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PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington DC 20231 

Sir: 

Prior to initial examination, please amend the above- 
identified patent application as follows: 

IN THE SPECIFICATION : 

Page 1, after the title and before line 3, insert the 
following headings: 

-- BACKGROUND OF THE INVENTION 
1 . Field of the Invention - - 



PLAG GENE FAMILY AND 
TUMOR I GENE SIS 
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Page 1, line 5, start a new paragraph at "The invention 
in particular" and a new heading as follows: 
--2. Description of the Related Art --. 

Page 2, before line 19, insert the following headings and 
paragraphs : 

- - SUMMARY OF THE INVENTION 

The present invention is directed to the identification 
of and isolation of nucleic acids corresponding to the PLAG and 
CTNNB1 genes, translocation fusions thereof and derivatives 
thereof, including transcripts, cDNAs , sense and antisense 
nucleic acids, proteins derived from the genes, antibodies and 
fragments thereof which are directed to proteins corresponding 
to the genes and T-genes and derivatives thereof which are 
identified through use of the nucleic acids or derivative nucleic 
acids as a probe or primer. 

The present invention is further directed to methods for 
diagnosing abnormal expression of the PLAG and CTNNB1 genes, for 
inhibiting expression of these genes and for identifying 
homologous genes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a physical contig and STS marker map in 
proximal human 8ql2; 

Fig. 2 is a physical contig and STS marker map in distal 
human 8ql2; 



2 



Willem Jan Marie VAN DE VEN et al . 3 

Fig. 3A is a contig of 3 overlapping YACs (bold lines) , 

2 7 cosmids and 2 phages, containing 2 7 landmarks and spanning a 

3 00 kb DNA region on chromosome 8ql2 ; 

Fig. 3B shows mapping of the 8ql2 translocation 
breakpoint in an adenoma (CG644) with a t(3;8) (p21;ql2); 

Fig. 4A shows the cDNA and deduced amino acid sequence of 

PLAG1; 

Fig. 4B shows the alignment of the seven zinc-finger-like 
motifs found in the deduced amino acid sequence of PLAG1 relative 
to the C2H2 consensus motif; 

Fig. 5 shows an illustrative example of the detection of 
DNA rearrangement in PLAG1 of a primary pleomorphic adenoma using 
Southern blot analysis; 

Fig. 6A shows the detection of CTNNB1 / PLAG1 and 
PLAG1 / CTNNB1 fusion transcripts by RT-PCR in primary adenomas; 

Fig. 6B shows a schematic representation of the nature 
and origin of CTNNB1 /PLAG1 and PLAG1 / CTNNB1 fusion transcripts 
in primary pleomorphic adenomas with t(3;8) (p21;ql2); 

Fig. 7A shows a Northern blot analysis of the expression 
pattern of PLAG1 in normal human fetal tissues; 

Fig. 7B shows a Northern blot analysis of the expression 
pattern of the CTNNB1 gene in normal human fetal and adult 
tissues ; 

Fig. 7C shows detection of CTNNB1 / PLAG1 transcripts in 
pleomorphic adenomas by Northern blot analysis using exon 1 of 
CTNNB1 as a molecular probe; 

3 
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Fig. 7D shows detection of CTNNBl/PLAGl transcripts using 
a probe with specificity for the 3' UTR of PLAG1 (probe KK64) ; 

Fig. 8 shows the sequence of the PLAG2 cDNA with the open 
reading frame underlined ; 

Fig. 9 shows the nucleotide sequence of cDNA of CTNNB1 
(,6-catenin) ; and 

Fig. 10 shows STSs used to generate the 300 kb cosmid 
contig mapping at chromosome 8ql2 and encompassing PLAG1 . 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS - - . 

IN THE CLAIMS : 

Please cancel original claims 1-25 and insert new claims 
26-48 as follows: 

--26. A nucleic acid in isolated form, comprising one of 
an oligonucleotide, a polynucleotide and a gene having a 
nucleotide sequence of at least a part of a T-gene selected from 
the group consisting of the PLAG subfamily of zinc finger protein 
5 genes, the CNNB1 gene and fusions thereof, or complementary or 
degenerate versions of the nucleotide sequence. 

27. The nucleic acid as claimed in claim 26, wherein the 
nucleic acid is one of an oligonucleotide, a polynucleotide and 
a gene having a sequence of at least a part of a gene selected 
from the group consisting of PLAG1, PLAG2 and CNNB1 , sequences 
5 complementary thereto and degenerate sequences thereof . 
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28. The nucleic acid as claimed in claim 26, having 
homology with the zinc finger domains of the PLAG1 gene the 
nucleotide sequence of which is depicted in figure 4A, or the 
complementary strand thereof, including modified, degenerate or 
elongated versions of both strands. 

29. The nucleic acid as claimed in claim 26, comprising 
the nucleotide sequence of the PLAG1 gene as depicted in figure 
4A, or the complementary strand thereof, including modified, 
degenerate or elongated versions of both strands. 

30. The nucleic acid as claimed in claim 26, comprising 
the nucleotide sequence of the PLAG2 gene as depicted in figure 
8A, or the complementary strand thereof, including modified, 
degenerate or elongated versions of both strands. 

31. The nucleic acid as claimed in claim 26, comprising 
the nucleotide sequence of the CTNNB1 gene as depicted in figure 
9, or the complementary strand thereof, including modified or 
elongated versions of both strands. 

32. The nucleic acid as claimed in claim 26, wherein the 
nucleic acid is labeled. 

33. A raacromolecule comprising a derivative of a nucleic 
acid in isolated form, comprising one of an oligonucleotide, a 
polynucleotide and a gene having a nucleotide sequence of at 
least a part of a T-gene selected from the group consisting of 
the PLAG subfamily of zinc finger protein genes, the CNNB1 gene 
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and fusions thereof, or complementary or degenerate versions of 
the nucleotide sequence. 

34. The macromolecule as claimed in claim 33, wherein 
the derivative is selected from the group consisting of: 

a) a transcript corresponding to the nucleic acid; 

b) cDNA corresponding to the nucleic acid; 

c) sense or antisense DNA corresponding to the nucleic 

acid; 

d) a nucleic acid including a gene, or a derivative 
thereof, isolated by using at least a part of a T-gene as one of 
a probe or primer; 

e) a protein encoded by the nucleic acid; and 

f) antibodies, or derivatives thereof, directed to the 
nucleic acid, the transcript, the cDNA and the protein. 

35. The macromolecule as claimed in claim 34, wherein 
the derivative is labeled.' 

36. A diagnostic kit comprising one of a labeled nucleic 
acid as claimed in claim 2 6 and a labeled macromolecule 
derivative of the nucleic acid and one or more diagnostic 
reagents . 

37. The kit as claimed in claim 36, wherein the kit 
comprises labeled T-gene specific and tail specific PLR primers. 
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38. The kit as claimed in claim 36, wherein the 
macromolecule is a set of labeled nucleic acid probes and the kit 
further comprises a restriction enzyme. 

39. The kit as claimed in claim 36, wherein the 
macromolecule is a labeled probe for in situ diagnostics. 

40. An in situ diagnostic method for diagnosing 
interphase and/or metaphase cells having a non-physiological 
proliferative capacity, comprising the steps: 

a) designing a set of nucleotide probes based on a 
nucleic acid as claimed in claim 26, wherein at least one of the 
probes is hybridisable to a region of the aberrant gene 
substantially mapping at the same locus as a corresponding region 
of the wildtype gene and/or the same or another probe is 
hybridisable to a region of the aberrant gene mapping at a 
different locus than a corresponding region of the wildtype gene; 

b) incubating one or more interphase or metaphase 
chromosomes or interphase or metaphase cells having a non- 
physiological proliferative capacity, with the probe (s) under 
hybridising conditions; and 

c) visualising the hybridisation between the probe (s) 
and the gene . 

41. A method of diagnosing cells having a non- 
physiological proliferative capacity, comprising the steps of: 

a) taking a biopsy of a tumor to obtain cells to be 
diagnosed; 
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b) isolating a suitable T-gene-related macromolecule 
therefrom; 

c) analysing the macromolecule thus obtained by 
comparison with a wildtype reference molecule. 

42. The method as claimed in claim 41, comprising the 
steps of : 

a) taking a biopsy of a tumor to obtain cells to be 
diagnosed; 

b) extracting total RNA thereof ; 

c) preparing at least one first strand cDNA of the mRNA 
species in the total RNA extract, which cDNA comprises a suitable 
tail ; 

d) performing one of a PCR and a RT-PCR using one of a 
PLAG gene specific primer, a tail -specif ic and a partner- 
specific/nested primer to amplify PLAG gene specific cDNA's; 

e) separating the PCR products on a gel to obtain a 
pattern of bands ; 

f) evaluating the presence of aberrant bands by 
comparison to wildtype bands. 

43. The method as claimed in claim 41, comprising the 
steps of: 

a) taking a biopsy of a tumor to obtain cells to be 
diagnosed; 

b) isolating total protein therefrom; 
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c) separating the total protein on a gel to obtain 
essentially individual bands ; 

d) hybridising the bands thus obtained with antibodies 
directed against a part of the protein encoded by a remaining 

10 part of the T-gene and against a part of the protein encoded by 
a substitution part of the T-gene; 

e) visualising the ant igen- antibody reactions and 
establishing the presence of aberrant bands by comparison with 
bands from wildtype proteins. 

44. The method as claimed in claim 41, comprising at 
least some of the following steps: 

a) taking a biopsy of a tumor to obtain cells to be 
diagnosed ; 

5 b) isolating total DNA therefrom; 

c) digesting the DNA with a restriction enzyme; 

d) separating the digest thus prepared on a gel to 
obtain a separation pattern; 

e) hybridising the separation pattern in the gel or on 
10 the blot with a labeled nucleic acid in isolated form, comprising 

one of an oligonucleotide, a polynucleotide and a gene having a 
nucleotide sequence of at least a part of a T-gene selected from 
the group consisting of the PLAG subfamily of zinc finger protein 
genes, the CNNB1 gene and fusions thereof, or complementary or 
15 degenerate versions of the nucleotide sequence; and 

f) visualising the hybridisations and establishing the 
presence of aberrant bands by comparison to wildtype bands. 

9 
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45. A method for identifying a T-gene comprising the 
steps of : 

a) preparing one of a probe and a primer of a nucleic 
acid as claimed in claim 26; 

b) isolating a gene which hybridises to the probe or 

primer . 

46. A method for inhibiting expression of -a T-gene 
comprising contacting a cell with a derivative as claimed in 
claim 33, wherein the derivative is one of an antisense nucleic 
acid, a nucleic acid coding for an antisense molecule, or 
otherwise interferes with expression of a T-gene, and an antibody 
or a derivative thereof. -- 

IN THE ABSTRACT : 

After the claims, please insert a page containing the 
Abstract Of The Disclosure , which is attached hereto as a 
separately typed page. 

REMARKS 

Amendments have been made to the specification to place 
the application in conformance with standard United States Patent 
practice . 

Original claims 1-25 have been cancelled by this 
Preliminary Amendment. These claims have been rewritten as 
claims 26-46 in order to eliminate the multiple dependencies and 
to conform the claims to customary United States Patent practice. 
No new matter is added by the above-referenced amendments. 



Willem Jan Marie VAN DE VEN et al . 



11 



The SUMMARY OF THE INVENTION does not present new matter 



because it is a compilation of excerpts taken from various places 
in the as-filed specification. 



together with a diskette in computer readable format. 

An Abstract Of The Disclosure has been added as a 
separately typed page to be inserted after the claims. 

Examination and allowance of claims 26-46 are 
respectively requested. 



A Sequence Amendment will be filed at a later date 



Respectfully submitted, 



WEBB ZIESENHEIM BRUENING LOGSDON 



ORKIN & HANSON, P.C. 




•parbar^ Reg . No . 31 , 198 
"Attornef :or Applicants 
700 Koppers Building 
43 6 Seventh Avenue 
Pittsburgh, PA 15219-1818 
Telephone: 412/471-8815 
Facsimile: 412/471-4094 
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PLAG GENE FAMILY AND TDMORIGENESIS 



The present invention relates to the identification of 
the PLAG gene family as a family of genes frequently 
5 associated with tumorigenesis . The invention in particular 
relates to the identification of one member of this gene 
family that is involved in benign tumors such as those with 
chromosome anomalies involving a particular region of the 
long arm of chromosome 8, for instance but not restricted to 
10 pleomorphic adenomas of the salivary glands with involvement 
of chromosome 8ql2 and a translocation partner chromosome, 
very frequently chromosome 3 [i.e. t (3 ;8) (p21;ql2) ] . In 
addition, the invention relates to another member of this 
novel family as the gene whose expression is frequently 
15 abrogated in malignant tumors such as for example but not 
restricted to malignant salivary gland tumors. Furthermore, 
the invention concerns the identification of the CTNNB1 gene 
as a prototype of tumor-specific breakpoint region genes and 
frequent fusion partner of the PLAG genes. The invention 
20 relates in particular to the use of the members of the PLAG 
gene family and corresponding fusion partner genes as well 
as their derivatives in diagnosis and therapy. 

Multiple independent cytogenetic studies have firmly 
implicated chromosome region 8qll-13 in the following tumor 
25 types: pleomorphic adenomas of the salivary glands (high 
frequency) , pleomorphic adenomas of the lacrimal glands 
(high frequency) , lipoblastomas (high frequency) , solitary 
lipomas (low frequency) , rhabdomyosarcomas (low frequency) , 
and renal cell carcinomas (low frequency) . As an 
3 0 illustration, we here focus on pleomorphic adenomas of the 
salivary gland. This type of tumor constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary 
gland tumor and accounts for almost 50% of all neoplasms in 
3 5 these organs; 85% of the tumors are found in the parotid 
gland, 10% in the minor salivary glands and 5% in the 
submandibular gland. About 50% of these adenomas appear to 
have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome anomalies. Frequently 
observed anomalies include aberrations of chromosome 8, 
usually involving the 8ql2 region with as most common 
aberration a t(3;8) (p21;ql2) , and aberrations of chromosome 
5 12, usually translocations involving region 12ql3-ql5. Non- 
recurrent clonal chromosome abnormalities have also been 
reported. The highly specific pattern of chromosome 
rearrangements with consistent breakpoints at 8ql2 and 
12ql3-ql5 suggests that these chromosomal regions harbour 
10 genes that might be implicated in the development of these 
tumors . 

In previous physical mapping studies, the chromosome 
8ql2 breakpoint in pleomorphic adenoma of the salivary 
glands was mapped between MPS and PENK . In an effort to 

15 molecularly clone the gene affected by the chromosome 8ql2 
aberrations in the various tumors, the present inventors 
chose directional chromosome walking as a structural appro- 
ach to define the DNA region encompassing these breakpoints. 
As a starting point for chromosome walking, the MPS 

20 locus was used. During these walking studies, YAC clones 

corresponding to eight different loci in 8qll-12 were isola- 
ted and mapped by FISH analysis. Initially, the breakpoints 
were mapped within a 1 Mb region flanked by MPS proximal ly 
and by the genetic marker D8S166 distally. One YAC (CEPH 

25 166F4) within this region was shown to span the t(3;8) 

breakpoint in two tumors. Subsequently, it was shown by FISH 
that the chromosomal breakpoints as present in a number of 
primary pleomorphic adenomas were clustered within a small 
chromosomal segment which has been designated pleomorphic 

30 adenoma breakpoint cluster region (PA-BCR) . 

It has thus been found that essentially all breakpoints 
of chromosome 8ql2 map in a region indicated herein as PA- 
BCR. Further research revealed that in this region a member 
of the PLAG gene family, the PLAG1 gene, can be identified 

3 5 as the breakpoint target gene and postulated tumor aberrant 
growth gene. The PLAG gene family is a subclass of the zinc 
finger gene superfamily. The CTNNB1 gene was identified as 
the fusion partner gene of PIAG1 . Molecular analysis of the 
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chromosome breakpoints revealed that the chromosome translo- 
cations resulted in gene fusions. Fusions appeared to occur 
in the 5 ' -non-coding regions of both genes (PLAG1 and CTNN- 
Bl ) , exchanging regulatory control elements while preserving 
5 the coding sequences. Due to the t(3;8) (p21;ql2) , PIAGl is 
activated and expression of CTNNB1 is down-regulated. Acti- 
vation of PIAGl was also observed in adenomas with variant 
translocations, i.e. t(5;8) and t(8;15) . The results of the 
present inventors indicate that PIAGl activation due to 

10 promoter swapping is a crucial event in salivary gland 
tumor igenes is . 

The preferential fusion partner of PIAGl is the CTNNB1 
gene, which encodes B-catenin, a cytoplasmic protein of 
about 88 kD. Its frequent involvement in pleomorphic adeno- 

15 mas point towards a critical role of the CTNNB1 gene. The 
major role of CTNNB1 may be to provide an active promoter in 
front of the PIAGl gene. However, since the observed promo- 
ter swapping between PIAGl and CTNNB1 leads to down-regula- 
tion of CTNNB1 expression, it may also lead to other physio- 

20 logical consequences, especially since B-catenin is a pro- 
tein that has been implicated in highly diverse processes, 
such as human colon cancer, epithelial cell adhesion, embry- 
onal axis formation in Xenopus , and pattern formation in 
Drosophila . 

25 According to the present invention the aberrations in 

the PIAGl gene on chromosome 8 and the CTNNB1 gene on chro- 
mosome 3 have been used as a model to reveal the more gene- 
ral concept of the involvement of members of these gene 
families in tumorigenesis . Although both gene families are 

30 known per se, up till the present invention the correlation 
between these families and tumor inducing chromosome aberra- 
tions, like translocations, deletions, insertions and inver- 
sions, has not been anticipated. Furthermore, until now, it 
was not previously demonstrated that alterations in the 

35 physiological expression level of the members of the gene 
family are probably also implicated in tumor development. 
According to the invention, it was demonstrated that in 
normal adult cells the expression level of the PIAGl gene is 
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practically undetectable, whereas in aberrantly growing 
cells, the expression level is significantly increased. It 
appears that PLAG1 is a oncogene. 

The research that led to the present invention 
5 indicates that it is probable that abberations in the PIAGl 
gene, either in its sequence and/or its expression, will 
usually lead to a benign tumor. Furthermore, preliminary 
research indicated that expression of PLAGl is increased in 
various types of malignant salivary gland tumors. 
10 Another member of the FLAG gene family is the PLAG2 

gene. The present inventors have identified this gene, 
determined its nucleotide sequence and predicted its amino 
acid sequence. It was found that PLAG2 mapped to a region 
frequently deleted in malignant salivary gland tumors 
15 (6q24). Thus, contrary to PIAGl , PLAG2 may be a 
tumorsuppressor gene. 

The present invention now provides for a tool to 
investigate these theories and may ultimately lead to a 
means for distinguishing between benign and malignant 
20 tumors. This knowledge can then lead to more efficient 

methods of treatment. Thus, the present invention now provi- 
des for the members of the gene families or derivatives 
thereof in isolated form and their use in diagnostic and 
therapeutic applications. Furthermore, the knowledge on the 
25 location and nucleotide sequence of the genes may be used to 
study their rearrangements or expression and to identify a 
possible increase or decrease in their expression level and 
the effects thereof on cell growth. Based on this informati- 
on diagnostic tests or therapeutic treatments may be desig- 
3 0 ned . 

In this application, " PIAG " will be used to indicate 
the involvement of these types of genes in various types of 
tumors, not necessarily restricted to pleomorphic adenomas 
of the salivary glands. The term refers to all members of 
35 the PIAG gene family involved in non-physiological 

proliferative growth, and in particular involved in benign 
or malignant tumors. Members of the PIAG gene family show 
homology between zinc finger domains that are typical for 



WO 98/07748 PCT7EP97/04759 

5 

the PIAG1 gene. However, according to the invention it was 
furthermore found that even breaks outside the actual genes 
but in the vicinity thereof might result in aberrant growth. 
The term " PIAG gene" is therefore also intended to include 

5 the immediate vicinity of the gene. The skilled person will 
understand that the "immediate vicinity" should be 
understood to include the surroundings of the gene in which 
the breaks or mutations will result in the above-defined 
non-physiological proliferative growth. 

0 The term •» Tumor i genes is gene" or "T-gene" will be used 

to indicate all members of this novel PLAG gene family and 
their corresponding translocation or fusion partners, like 
CTNNB1. 

The term "wildtype cell" is used to indicate the cell 
5 not harbouring an aberrant chromosome. "Wildtype" or "nor- 
mal" chromosome refers to a non-aberrant chromosome. 

The present invention provides for various diagnostic 
and therapeutic applications that are based on the informa- 
tion that may be derived from the genes. This information 
0 not only encompasses its nucleotide sequence or the amino 
acod sequence of the gene product derived from the cene, but 
also involves the levels of transcription or translation of 
the gene. 

The invention is thus two-fold. On the one hand the 
5 aberration in cell growth may be directly or indirectly 

caused by the physical breaks that occur in the gene or its 
vicinity. On the other hand the aberration in cell growth 
may be caused by a non-physiological expression level of the 
gene. This non-physiological expression level may be caused 
0 by the break, or may be due to another stimulus that activa- 
tes or deactivate the gene. At present, the exact mechanism 
or origin of the aberrant cell growth is not yet completely 
unraveled. However, exact knowledge at this time is not 
necessary to define methods of diagnosis or treatment. 
5 Diagnostic methods according to the invention are thus 

based on the fact that an aberration in a chromosome results 
in a detectable alteration in the chromosomes' appearance or 
biochemical behaviour. A translocation, for example will 
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result in a first part of the chromosome (and consequently 
of a PLAG gene) having been substituted for another (second) 
part (further referred to as "first and second substitution 
parts") . The first part will often appear someplace else on 
5 another chromosome from which the second part originates. As 
a consequence hybrids will be formed between the remaining 
parts of both (or in cases of triple translocations, even 
more) chromosomes and the substitution parts provided by 
their translocation partners. Since it has now been found 

10 that the breaks occur in a FLAG gene, this will result in 
hybrid gene products of that PLAG gene. Markers, such as 
hybridising molecules like RNA, DNA or DNA/RNA hybrids, or 
antibodies will be able to detect such hybrids, both on the 
DNA level, and on the RNA or protein level. 

15 For example, the transcript of a hybrid will still 

comprise the region provided by the remaining part of the 
gene/ chromosome but will miss the region provided by the 
substitution part that has been translocated. In the case of 
inversions, deletions and insertions the gene may be equally 

20 afflicted. 

Translocations are usually also cytogenetically detec- 
table. The other aberrations are more difficult to find 
because they are often not visible on a cytogenetical level. 
The invention now provides possibilities for diagnosing all 

25 these types of chromosomal aberrations. 

In translocations markers or probes based on the PLAG 
gene for the remaining and substitution parts of a chromoso- 
me in situ detect the remaining part on the original chromo- 
some but the substitution part on another, the translocation 

3 0 partner . 

In the case of inversions for example, two probes will 
hybridise at a specific distance in the wildtype gene. This 
distance might however change due to an inversion. In situ 
such inversion may thus be visualized by labeling a set of 
35 suitable probes with the same or different detectable mar- 
kers, such as fluorescent labels. Deletions and insertions 
may be detected in a similar manner. 
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According to the invention the above in situ applicati- 
ons can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmids one of which 
comprises exon 1 and the upstream region of the PLAG gene, 
5 while the other comprises the last exon and its downstream 
region. Both cosmids are labeled with different fluorescent 
markers, e.g. blue and yellow. The normal chromosome will 
show a combination of both labels, thus giving a green 
signal, while the translocation is visible as a blue signal 

10 on the remaining part of one chromosome (e.g. 8) while the 
yellow signal is found on another chromosome comprising the 
substitution part. In case the same labels are used for both 
probes, the intensity of the signal on the normal chromosome 
will be 100%, while the signal on the aberrant chromosomes 

15 is 50%. In the case of inversions one of the signals shifts 
from one place on the normal chromosome to another on the 
aberrant one. 

In the above applications a reference must be included 
for comparison. Usually only one of the two chromosomes is 

20 afflicted. It will thus be very convenient to use the normal 
chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or 
unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the PIAG1 gene 

25 of chromosome 8, breaks are usually found in the 5«-UTR as 
is shown by the present invention. Probes based on the 5'- 
and 3»-UTR are thus very useful. As an alternative a combi- 
nation of probes based on both translocation or fusion 
partners may be used, e.g. 5'-UTR of PLAG1 and 3»-UTR of 

30 CTNNB1. 

"Probes" as used herein should be widely interpreted 
and include but are not limited to linear DNA or RNA 
strands. Yeast Artificial Chromosomes (YACs) , or circular 
DNA forms, such as plasmids, phages, cosmids etc.. 
35 These in situ methods may be used on metaphase and 

interphase chromosomes . 

Besides the above described in situ methods various 
diagnostic techniques may be performed on a more biochemical 
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level, for example based on alterations in the DNA, RNA or 
protein or on changes in the physiological expression level 
of the gene. 

Basis for the methods that are based on alterations in 

5 the chromosome's biochemical behavior is the fact that by 
choosing suitable probes, variations in the length or compo- 
sition in the gene, transcript or protein may be detected on 
a gel or blot. Variations in length are visible because the 
normal gene, transcript (s) or prdtein(s) will appear in 
10 another place on the gel or blot then the aberrant one(s) . 
In case of a translocation more than the normal number of 
spots will appear. 

Based on the above principles the invention thus rela- 
tes to a method of diagnosing cells having a non-physiologi- 
15 cal proliferative capacity, comprising the steps of taking a 
biopsy of the cells to be diagnosed, isolating a suitable 
PLAG gene-related macromolecule therefrom, and analysing the 
macromolecule thus obtained by comparison with a reference 
molecule originating from cells not showing a non-physiolo- 

0 gical proliferative capacity, preferably from the same 

individual. The PLAG gene-related macromolecule may thus be 
a DNA, an RNA or a protein. The PLAG gene may be either a 
member of the PIAG1 family or of the translocation partner 
gene family of which the CTNNB1 gene is the prototype. 

5 In a specific embodiment the diagnostic method of the 

invention comprises the steps of taking a biopsy of the 
cells to be diagnosed, extracting total RNA thereof, prepa- 
ring a first strand cDNA of the mRNA species in the total 
RNA extract or poly-A-selected fraction (s) thereof, which 

0 cDNA comprises a suitable tail; performing a PCR using a 
PLAG gene-specific primer and a tail-specific primer in 
order to amplify PLAG gene-specific cDNA's; separating the 
PCR products on a gel to obtain a pattern of bands; evalua- 
ting the presence of aberrant bands by comparison to wildty- 

5 pe bands, preferably originating from the same individual. 
As an alternative amplification may be performed by 
means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [Compton, J. (1991) Nucleic acid sequence- 
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based amplification. Nature 3 50,91-92] or variations 
thereof . 

In another embodiment the method comprises the steps of 
taking a biopsy of the tumor to obtain cells to be 
5 diagnosed, isolating total protein therefrom, separating the 
total protein on a gel to obtain essentially individual 
bands, optionally transfering the bands to a Western blot, 
hybridising the bands thus obtained with antibodies directed 
against a part of the protein encoded by the remaining part 
10 of the PLAG gene and against a part of the protein encoded 
by the substitution part of the PLAG gene; visualising the 
antigen-antibody reactions and establishing the presence of 
aberrant bands by comparison with bands from wildtype 
proteins, preferably originating from the same individual. 
15 In a further embodiment the method comprises taking a 

biopsy of the tumor to obtain cells to be diagnosed; 
isolating total DNA therefrom; digesting the DNA with one or 
more so-called "rare cutter" (typically "6- or more 
cutters") restriction enzymes; separating the digest thus 
20 prepared on a gel to obtain a separation pattern; optionally 
transfering the separation pattern to a Southern blot; 
hybridising the separation pattern in the gel or on the blot 
with a set of probes under hybridising conditions; 
visualising the hybridisations and establishing the presence 
25 of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual. 

Changes in the expression level of the gene may be 
detected by measuring mRNA levels or protein levels by means 
of a suitable probe. 
3 0 Diagnostic methods based on abnormal expression levels 

of the gene may comprise the steps of taking a sample of the 
tumor to obtain cells to be diagnosed; isolating mRNA 
therefrom; and establishing the presence and/or the 
(relative) quantity of mRNA transcribed from the PLAG gene 
3 5 of interest in comparison to a control. Establishing the 
presence or (relative) quantity of the mRNA may be achieved 
by amplifying at least part of the mRNA of the PLAG gene by 
means of RT-PCR or similar amplification techniques. In an 
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alternative embodiment the expression level may be 
established by determination of the presence or the amount 
of the gene product (e.g. protein) by means of for example 
monoclonal antibodies. 
5 The diagnostic methods of the invention may be used for 

diseases wherein cells having a non-physiological prolifera- 
tive capacity are selected from the group consisting of 
benign tumors, such as the tumors pleomorphic adenomas of 
the salivary glands, lipoblastomas, uterine leiomyomas, and 
10 other benign tumors as well as various malignant tumors, 

including but not limited to sarcomas (e.g. rhabdomyosarco- 
ma) and leukemias and lymphomas. 

As already indicated above, it has been found that in 
certain malignant tumors the expression level of the PLAG 
15 genes is increased. Another aspect of the invention thus 
relates to the implementation of the identification of the 
PLAG genes in therapy. The invention for example provides 
anti-sense molecules or expression inhibitors of the PLAG 
gene for use in the treatment of diseases involving cells 

2 0 having a non-physiological proliferative capacity by modula- 

ting the expression of the gene. 

The invention thus provides derivatives of the PLAG 
gene and/or its immediate environment for use in diagnosis 
and the preparation of therapeutical compositions, wherein 
25 the derivatives are selected from the group consisting of 
sense and anti-sense cDNA or fragments thereof, transcripts 
of the gene or fragments thereof, antisense RNA, triple 
helix inducing molecule or other types of "transcription 
clamps", fragments of the gene or its complementary strand, 

3 0 proteins encoded by the gene or fragments thereof, protein 

nucleic acids (PNA) , antibodies directed to the gene, the 
cDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. Besides the use of direct 
derivatives of the genes and their surroundings (flanking 
35 sequences) in diagnosis and therapy, other molecules, like 
expression inhibitors or expression enhancers, may be used 
for therapeutic treatment according to the invention. An 
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example of this type of molecules are ribozymes that destroy 
RNA molecules. 

Bsides the above-described therapeutic and diagnostic 
methods, the principles of the invention may also be used 
5 for producing a transgenic animal model for testing pharma- 
ceuticals for treatment of PLAG -related malignant or benign 
tumors. One of the examples describes the production of such 
an animal model. 

It is to be understood that the principles of the 

10 present invention are described herein for illustration 
purposes only with reference to the PIAG1 gene mapping at 
chromosome 8ql2 and the CTNNB1 gene on chromosome 3p21. 
Based on the information provided in this application the 
skilled person will be able to isolate and sequence corres- 

15 ponding genes of the gene family and apply the principles of 
this invention by using the gene and its sequence without 
departing from the scope of the general concept of this 
invention . 

The present invention will thus be further elucidated 
20 by the following examples which are in no way intended to 
limit the scope thereof. 

EXAMPLES 

2 5 EXAMPLE 1 

1 . Introduction 

This example describes the isolation and analysis of 
overlapping YAC clones and the establishment of a YAC contig 
(set of overlapping clones) , which spans genomic DNA around 

30 the MPS locus and includes the translocation breakpoints, 
t (3;8) (p21;ql2) , of pleomorphic salivary glands (Fig. 8). 
Pleomorphic adenomas are benign epithelial tumors 
originating from the major and minor salivary glands. Cyto- 
genetic analysis has indicated that these tumors mainly 

35 display chromosome breakpoints in region ql2 of chromosome 
8. In previous studies we have shown that these breakpoints 
are located in the 9 cM interval between MOS / D8S285 and 
D8S260. Here we report directional chromosome walking stu- 
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dies starting from D8S260 and D8S285, resulting in, respec- 
tively, a rudimental YAC contig of 5 Mb and a highly detai- 
led YAC contig and long-range physical map of about 2 Mb. 
The latter YAC contig has at least double coverage and 
5 consists of 34 overlapping YAC clones isolated from the CEPH 
and ICRF YAC libraries. Their insert sizes were estimated by 
contour-clamped homogeneous electric field (CHEF) gel elec- 
trophoresis. Chromosomal localization of the YACs was inves- 
tigated by FISH analysis. On the basis of YAC insert and 

10 insert-end derived DNA markers and (publicly known) sequence 
tag sites (STSs) , with an average spacing of 65 kbp, as well 
as restriction enzyme analysis, a long-range physical map 
was established for this centromeric region of chromosome 
8ql2. Within the YAC contig, the relative positions of 

15 various known genes and expressed sequence tag sites was 
determined. This YAC contig constitutes the basis for the 
construction of a transcriptional map of this region. FISH 
analysis using YACs and cosmids, corresponding to STSs in 
this YAC contig, already revealed that all but two of the 

2 0 chromosome 8ql2 breakpoints in the primary tumors that were 
tested, mapped within a 3 00 kb interval between the MOS 
proto-oncogene and STS EM156. 

2 . Materials and methods 
25 2.1. Pleomorphic adenomas of the salivary glands and cytoge- 
netic analysis. 

Primary pleomorphic adenomas of the salivary glands 
were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were established and 
30 analyzed as described before (Roijer et al. f 1996). The 

primary tumors with their aberrations used in this study are 
listed in Table 1. Slides for FISH were prepared from cells 
stored in fixative at -20 °C. 

35 2.2. Fluorescence in situ hybridization (FISH) 

Slides for FISH were aged for 2-10 days, or were trea- 
ted with 2x SSC (pH 7.0) at 37°C for 30 min, and subse- 
quently denatured in 70% formamide in 2x SSC (pH 7.0), at 
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75 °C for 2-6 minutes. Cosmid and YAC DNAs (1 fig) were labe- 
led with biotin-16-dUTP (Boehringer Mannheim) by nick trans- 
lation. Labeled DNA was purified through a Sephadex G50 
column (Pharmacia) , coprecipitated with 50 mg sonicated 
5 human placenta DNA (Sigma) , and dissolved in hybridization 
solution (50% formamide, 2x SSC pH 7.0, 50 mM sodium phosp- 
hate pH 7.0, 10% dextran sulfate). Hybridization and probe 
detection were carried out as previously described (Roijer 
et al. # 1996). The alpha-satellite probe D8Z2 was obtained 

10 from Oncor. Slides were examined in a Zeiss Axiophot 
epif luorescence microscope using the appropriate filter 
combinations. Fluorescence signals were digitalized, 
enhanced and analyzed using the ProbeMaster FISH image 
analysis system (Perceptive Scientific Instruments, Houston, 

15 Texas) . Color prints were produced using a Kodak XL 7700 
monochrome continuous printer. 

2 . 3 DNA preparations . 

Extraction of genomic DNA and the Southern analyses 

20 were performed as described previously (Schoenmakers et al., 
1994) . DNA from YACs, cosmids, PCR products and oligonucleo- 
tides was labelled using a variety of techniques. For FISH, 
cosmid clones or inter-Alu PCR products of YACs were bioti- 
nylated with biotin-ll-dUTP (Boehringer) by nick translati- 

25 on. YAC DNA (100 ng) was amplified by inter Alu PCR (PI: 

CTGCACTCCAGCCTGGG , P2 : TCCCAAAGTGCTGGGATTACAG ) . After initi- 
al denaturation for 5 min at 94 °C, 30 amplification cycles 
were performed each consisting of denaturation for 1 min at 
94 °C, annealing for 30 sec at 37 °C and extension for 6 min 

30 at 72 °C, and with a final extension at 72 °C for 10 min. 
Amplified DNA was purified with QIAQuick PCR Purification 
kit (Qiagen) . For filter hybridizations, probes were 
radio-labelled with alpha- 32 P-dCTP using random hexamers 
(Feinberg and Vogelstein, 1984). In case of PCR-products 

35 smaller than 200 bp in size, a similar protocol was applied, 
but specific oligonucleotides were used to prime labelling 
reactions. Oligonucleotides were labelled using 
gamma - 32 P- ATP . 
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2.4 YAC library screening. 

YAC clones in this paper were isolated from the CEPH 
mark 1 (Albertsen et ai., 1990) and CEPH mark 3 (Chumakov et 
al., 1992) YAC library, made available to us by the Centre 
5 d' Etude du Polymorphisme Humain (CEPH) . YACs were isolated 
using a combination of PCR-based screening and colony hy- 
bridization analysis (Green and Olson, 1990) . Contaminating 
Candida parapsylosis, which was sometimes encountered, was 
eradicated by adding terbinafin to the growth medium (final 
10 concentration of 25 microgram/ml) . The isolated YAC clones 
were characterized by STS-content mapping, contour-clamped 
homogeneous electric field (CHEF) electrophoresis (Chu et 
al., 1986), restriction mapping and hybridization and FISH 
analysis . 

15 

2.5 Cosmid library screening. 

Cosmid clones were isolated from an arrayed human 
chromosome 8-specific cosmid library (Wood et al., 1992) 
obtained from Los Alamos National Laboratory (LANL) . 
20 LANL-derived cosmid clones are indicated by their microtiter 
plate addresses. Cosmid DNA was extracted using standard 
techniques involving purification over Qiagen tips (Qiagen) . 

2.6 Pulsed-field gel electrophoresis and Southern blot 
25 analysis. 

Pulsed-field gel electrophoresis and Southern blot 
analysis were performed exactly as described by Schoenmakers 
et al. (1994). Agarose plugs containing high -molecular 
weight YAC DNA (equivalent to about 1 x 108 yeast cells) 

30 were twice equilibrated in approximately 25 ml TE buffer (pH 
8.0) for 30 min at 50 °C followed by two similar rounds of 
equilibration at room temperature. Plugs were subsequently 
transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 30 min in 500 microliter of the approp- 

35 riate 1 x restriction-buffer at the appropriate restriction 
temperature . 

Thereafter, DNA was digested in the plugs according to 
the suppliers (Boehringer) instructions for 4 h using 30 
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units of restriction endonuclease per digestion reaction. 
After digestion, plugs along with appropriate molecular 
weight markers were loaded onto a 1% agarose / 0.25 x TBE 
gel, sealed with LMP-agarose and size fractionated on a CHEF 
5 apparatus (Biorad) for 18 h at 6.0 V/cm using a pulse angle 
of 120 degrees and constant pulse times varying from 10 sec 
(separation up to 300 kbp) to 20 sec (separation up to 500 
kbp) . In the case of large restriction fragments, additional 
runs were performed, aiming at the separation of fragments 

10 with sizes above 500 kbp. 

Electrophoresis was performed at 14 °C in 0.25 x TBE. As 
molecular weight markers, lambda ladders (Promega) and 
home-made plugs containing lambda DNA cut with restriction 
endonuclease Hindlll were used. After electrophoresis, gels 

15 were stained with ethidium bromide, photographed, and UV 
irradiated using a stratalinker (Stratagene) set at 12 0 mJ. 
DNA was subsequently blotted onto Hybond N+ membranes 
(Amersham) for 4-16 h using 0.4 M NaOH as transfer buffer. 
After blotting, the membranes were dried for 15 min at 80 °C, 

20 briefly neutralised in 2 x SSPE, and prehybridised for at 
least 3 h at 42 °C in 50 ml of a solution consisting of 50% 
formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 
microgram/ml heparin. Filters were subsequently hybridised 
for 16 h at 42 °C in 10 ml of a solution consisting of 50% 

25 formamide, 5 x SSPE, 1 x Denhardts, 0.1% SDS, 100 

microgram/ml heparin, 0.5% dextran sulphate and 2-3 x 106 
cpm/ml of labelled probe. Thereafter, membranes were first 
washed two times for 5 min in 2 x SSPE/0.1% SDS at room 
temperature, then for 30 min in 2 x SSPE/0.1% SDS at 42 °C 

30 and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65°C 
Kodak XAR-5 films were exposed at -80 °C for 3-16 h, 
depending on probe performance. Intensifying screens (Kyokko 
special 500) were used. 

Agarose plugs containing high-molecular weight yeast + 

35 YAC DNA (equivalent to 1 x 10 9 cells ml" 1 ) were prepared as 
described before (Schoenmakers et al., 1994). Plugs were 
thoroughly dialysed against four changes of 25 ml T10E1 (pH 
8.0) followed by two changes of 0.5 ml 1 x restriction 
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buffer before they were subjected to either pulsed-field 
restriction enzyme mapping or YAC-end rescue. 

2.7 PCR analysis. 
5 PCR amplification was carried out using a Pharmacia 

LKB-Gene ATAQ Controller (Pharmacia/LKB) or a Perkin Elmer 
9600 (Perkin-Elmer Cetus) in final volumes of respectively 
50 and 25 microliter containing 10 mM Tris-HCl pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01% gelatine, 2 mM dNTPs, 20 pmole of 

10 each amplimer, 1.25 units of AmpliTaq (Perkin-Elmer Cetus), 
and 100 ng (for superpools) or 20 ng (for pools) of DNA. 
After initial denaturation for 5 min at 94 °C, 30 amplifica- 
tion cycles were performed each consisting of denaturation 
for 1 min at 94 °C, annealing for 1 min at the appropriate 

15 temperature (see Table 3), and extension for 1 min at 72 °C, 
and with a final extension at 72 °c for 10 min. Results were 
evaluated by analysis of 10 microliter of the reaction 
product on 2% agarose gels. 

20 2.8 Generation of STSs from YAC inserts and insert ends. 

YAC-end rescue was performed using a vectorette-PCR 
procedure in combination with direct solid phase DNA sequen- 
cing, as described before (Geurts et al., 1994). STSs speci- 
fic for YAC inserts were generated from inter-Alu PCR pro- 

25 ducts, isolated using published oligonucleotides TC65 or 517 
(Nelson et al., 1989) to which Sail-tails were added to 
facilitate cloning. Figure 10 shows some of these STSs. 
After sequence analysis, primer pairs were developed using 
the OLIGO computer algorithm (Rychlik, 1989) . They were 

30 tested on human genomic DNA, basically according to 
procedures described above. 

2.9 Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the 
3 5 dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Cycle Sequen- 
cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
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standard 30 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI; Altschul et al. , 1990). All 
oligonucleotides were purchased from Pharmacia Biotech. 



3 . Results 

3.1 Isolation of YAC clones to start chromosome walking. 

According to radiation hybrid mapping data, band 8ql2 
is flanked by markers D8S165, D8S285 proximally, and by 
10 D8S260 distally (Sapru et al. , 1994). The estimated genetic 
distance between these markers is 9 cM (Gyapay et al. , 
1994) . Using primer sets for these microsatellite markers, 
we selected sets of corresponding YAC clones. These YACs 
were used for initiating the walk towards the 8ql2 translo- 
15 cation breakpoints, meanwhile generating a long-range contig 
based upon STS-content mapping of these. Hence we started to 
construct a 8ql2 centromeric YAC contig (from D8S165 / 
D8S285) and a 8ql2 telomeric YAC contig (from D8S260) . By 
fluorescent in situ hybridisation (FISH) experiments, we 
2 0 could soon show that all but two of the translocation 

breakpoints in primary adenomas mapped within a 1 cM inter- 
val within the centromeric YAC contig (Roijer et al., 1996). 
Hence, the YAC contig of the telomeric side of 8ql2 was not 
characterized in that much detail. The two YAC contigs could 
25 not be linked to each other. 

3.2 Assembly of a YAC contig starting from D8S285 

In previous studies (Roijer et al. , 1996) we have 
described the isolation of two overlapping YACs containing 

30 D8S285, namely 935E9 and 166F4. These YACs, together with a 
MOS containing YAC (900gl0157) , were used as a starting 
point for a chromosome walking project to define the 8ql2 
centromeric region, as mentioned in the introduction. Initi- 
ally, chromosome walking was performed bidirectionally . This 

35 allowed us to hook up our 8ql2 centromeric contig to a 
chromosome 8qll YAC contig. Both contigs have the marker 
D8S593 in common. In the subsequent PCR based screening of 
the human CEPH YAC libraries with D8S108 and D8S166, we 
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isolated MegaYAC 946B7, which together with MegaYAC 935E9, 
formed the backbone of our centromeric contig. 

In the bidirectional and subsequent unidirectional 
chromosome walking steps, the following general procedures 
5 were used. First, rescuing and sequencing the ends of YAC 
clones resulted in DNA markers characterizing their left and 
right sides (Table 2). Based on sequence data on the ends of 
the YAC insert, as well as on inter-Alu PCR products, primer 
sets were developed for specific amplification of DNA, 

10 establishing STSs (Table 3). Their location to 8ql2-qter was 
determined by CASH as well as by FISH after corresponding 
YAC and/ or cosmid clones were isolated. With the sequenti- 
ally selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the buil- 

15 ding blocks for contig assembly. Hence, we established a 

8ql2 centromeric YAC contig consisting of 34 overlapping YAC 
clones, covering approximately 2 Mb of DNA (Fig. 1) . The 3 4 
YACs are between 160 kb and 1660 kb long. This YAC contig 
appeared to encompass the chromosome 8ql2 breakpoints of all 

20 but two primary adenomas studied. YAC 166F4 for instance, 
was already shown to cross the translocation breakpoint in 
two adenomas with t(3;8) (p21;ql2) (Roijer et al., 1996). 
Characteristics of the YACs that were used to build this 
contig are given in Table 2. 

25 The YAC contig was constructed in parallel with the 

screening data of Whitehead Institute / MIT Center for 
Genome Research available via http://www-genome.wi.mit.edu- 
/cgi-bin/contig/lookup_contig (Contigs WC8.7 and WC-157) . 

3 0 3.3 Physical mapping of polymorphic markers, ESTs and genes 
All markers and genes which were or became publicly 
available during our work were STS content mapped on our 
contig and those found positive were subsequently sublocali- 
zed by (primer) hybridization on YAC Southern blots. The 

3 5 markers that were found to reside within the centromeric 
contig presented here were D8S1069 (WI-536) , AFMB055WG9, 
D8S125, D8S108, D8S1661 (WI-5459), D8S166, D8S96, D8S1816, 
D8S285, D8S1516 (WI-3862), D8S1828 and D8S165. Furthermore, 
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besides the MOS proto-oncogene (Testa et al., 1988), two 
other genes could be positioned in our centromeric YAC 
contig, namely the YES related proto-oncogene LYN (Corey and 
Shapiro, 1994) and the preproenkephalin gene PENK (Tomfohrde 
5 et al., 1992). Finally, our contig contains 2 expressed 
sequence tags, namely WI-7754 (D8S1930) , corresponding to 
LYN, and IB2045, not resembling any currently known gene. 
The genes encoding the kappa opioid receptor (OPRK1) , 
(Yasuda et al. , 1994) and interleukin-7 (IL7) (Brunton and 
10 Lupton, 1990) are not contained in our YAC contig. 

3.4 Restriction map and putative CpG islands: Construction 
of a rare-cutter physical map from the 2 Mb YAC contig 
in the 8ql2 centromeric region 

15 Southern blots of total yeast plus YAC DNA, digested to 

completion with rare-cutter enzymes BssHII, Kpnl, Mlul, 
NotI, Pvul, Sail and Sfil (see Materials and Methods) and 
separated on CHEF gels, were hybridized sequentially with 
(i) the STS used for the initial screening of the YAC in 

20 question, (ii) pYAC4 right arm sequences, (iii) pYAC4 left 
arm sequences, and (iv) a human Alu-repeat probe (BLUR-8) . 
The long-range restriction map that was obtained in this way 
was completed by probing with PCR-isolated STSs/YAC end 
probes. Restriction maps of individual YAC clones were 

25 aligned, and a consensus restriction map was established. 
The region was searched for CpG islands on the basis of the 
colocalization of sites for these rare-cutter enzymes (Fig. 
1) • 

30 3.5 FISH mapping of 8ql2 breakpoints in pleomorphic 
adenomas 

In previous studies we have mapped the t (3 ;8) (p21;ql2) 
breakpoint within a 1 Mb region flanked by MOS proximally 
and by the genetic marker D8S166 distally. One YAC within 
35 this region (166F4) was shown to span the t(3;8) breakpoint 
in two primary pleomorphic adenomas (Roijer et al. (1996) ) . 
We have now extended these previous observations by analysis 
of additional tumors including cases with different variant 
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translocations (Table 1) . Six of eight primary adenomas with 
8ql2 abnormalities had breakpoints mapping within YAC 166F4 
(Fig. 2A) . Also rumors with other 8ql2 rearrangements than 
the classical t(3;8) had breakpoints within the same YAC, 
5 demonstrating that this is the major breakpoint cluster 
region in pleomorphic adenomas with 8ql2 abnormalities. 

To fine map this region we isolated cosmids from an 
arrayed human chromosome 8-specific cosmid library (obtained 
from Los Alamos National Laboratory; Wood et al. (1992)) 
10 using markers contained within YAC 166F4. Cosmid clones 

containing MOS, EM156 and CH129 were isolated and mapped by 
FISH (Fig. 1) . The breakpoints in all six cases were 
localized between cosmids CEM1 (MOS) en CEM23 (EM156) 
(Fig. 2B and C) . In for example CG 666, CEM1 was transposed 
15 to 8p due to a pericentric inversion of the dicentric marker 
chromosome (Fig. 2B) , while CEM23 remained in its normal 
position at 8ql2 (Fig. 2C) . In adenoma CG 682, which has an 
ins(8;3) (ql2;p21.3pl4.1) , CEM23 spanned the insertion 
breakpoint (Roijer et al. (1997)). 
20 Two of the eight adenomas tested, CG 650 and CG 787, 

had breakpoints clearly proximal to MOS. To further map 
these breakpoints we isolated two additional sets of YACs 
corresponding to CH37, D8S593 and the recently identified 
XRCC7 gene (Blunt et al. (1995)), respectively. In CG 650 
25 the breakpoint was found to reside between ICRF YAC 
900gl0157 and YAC 898G12 (Fig. 1) , and in CG 787 the 
breakpoint was found to reside within the XRCC7 containing 
YAC 943G4 (Fig. 2D). In the latter case, which judged from 
cytogenetics had. an ins(8;7) (qll-12 ; P 21pl4) , FISH analysis 
30 revealed signals on both the der(7) and the der(8) 
chromosomes, demonstrating that this is not a simple 
8:7-insertion but a complex rearrangement with at least two 
breakpoints on 8q, one of which maps within YAC 943G4 and 
the other proximal to this YAC. 
35 3.6 Assembly of a YAC contig between D8S260 and D8S507 
Two MegaYACs corresponding to D8S2 60, were used to 
start the chromosome walk from the telomeric side of chromo- 
some 8ql2.The YAC insert-ends from these two clones (783H12 
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and 814A6) were isolated and used for STS content mapping. 
The most telomeric STS (CH31) , is present in two YACs also 
containing D8S510, and hence links the 8ql2 YAC contig to a 
8ql3 specific YAC contig (Koenig et al., 1995). When we 
5 started our physical mapping effort, only one polymorphic 
marker, D8S507, was described in between D8S285 and D8S260 
(Gyapay et al., 1994). Initially, we were unable to isolate 
YACs containing D8S507. In stead we used the corresponding 
amplifiers to isolate a cosmid containing D8S507 (cosmid 

10 CEM3). An insert-ends of this cosmid clone (EM73) was then 
used to isolate corresponding MegaYACs (872E1, 882D9 and 
957C4) . It is unclear why these YACs were initially not 
recognised upon the YAC library screenings. This might have 
been due to either a less efficient PCR of D8S507 than of 

15 EM73 , or to an underrepresentation of the D8S507 YAC-contai- 
ning yeast cells in the primary pools of the specific 
library we screened. The availability of two new polymorphic 
markers, D8S1505 and D8S1515 (WI-6879) allowed to link the 
two separate MegaYAC contigs around D8S2 60 and D8S507 . The 

20 composite contig consists of 23 CEPH MegaYACs and covers 
approximately 5 Mb of genomic DNA (Fig. 3) . The polymorphic 
markers that were tested and found to reside in the telome- 
ric YAC contig are D8S1151 (WI-1155) , D8S260, D8S1075 
(WI-943), D8S1505, D8S1515 (WI-6879), D8S1723, D8S507 and 

25 D8S1957 (WI-9507). 

One gene, CYP7, encoding cholesterol 7a-Hydroxylase 
(and previously assigned to 8qll-12 using both mouse-human 
somatic cell hybrids and FISH (Cohen et al., 1992)) was 
mapped in this contig using a primer set that we developed on 

3 0 publicly available sequence data (see Table 3) . 

The YAC contig was constructed in parallel with the scree- 
ning data of Whitehead Institute / MIT Center for Genome 
Research available via http://www-genome.wi.mit.edu/ 
cgi-bin/ contig/ lookup_contig (Contigs WC8.8 and WC-7 27) . 

35 

±j_ Discussion 

In this example the physical map of 2 Mb of the 
centromeric region of chromosome 8ql2 composed of 34 YAC 
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clones is presented. It is tagged by 31 markers with an 
average interval of 65 kb and was validated by FISH mapping. 
The frequency of chimaerism in the set of YACs studied in 
detail (YAC end rescue and/or restriction enzyme mapping 
5 and/or FISH) was slightly lower (6/26 = 23%) than the 30-40% 
expected (Van Ornrnen, 1993) . These data provide the most 
detailed available physical map of the 8ql2 region. The map 
was constructed in parallel with the establishment of a 
low-resolution map of the human genome and with the 
10 screening data of the Whitehead Institute/MIT Center for 
Genome Research. It was compared with data available 
electronically via Internet. The order of the markers on the 
physical map is in agreement with the 1996 Genethon human 
genetic linkage map (Dib et al. (1996)). In total, 
15 31 markers/genes were ordered on the contig as follows; 
DSS1069-CH274-TC65.2-AFMB055WG9-CH277-CH34-D8S125 — 
D8S108-EM76-D8S1661-CH122-D8S96-D8S166-CH69-CH3 3-D8S1816-PE- 
NK-CH280-CH129-EM156-END2-CH283-D8S285-MOS-D8S1516-LYN-D8S1- 
828-IB2045-D8S165-CH37-W1086 . 
20 A restriction map with the enzymes BssHII, Kpnl, Mlul, 

NotI, Sail, Sfil, Swal was generated, revealing at least 7 
putative CpG islands. Three of these island correspond to 
previously known genes (PENK, MOS and LYN) , while the others 
might represent the 5* end of unidentified genes so far. Our 
25 2 Mb map integrates cytogenetic, genetic and physical data 
of chromosome band 8ql2 . Chromosome band 8gl2 seems to be 
very gene-poor: so far only 4 genes were shown to reside in 
both contigs (CYP7 , PENK, MOS and LYN). This is consistent 
with the view that Giemsa-positive bands are chromosome 
3 0 regions rather low in G+C content and poor in genes (Craig 
and Bickmore, 1993) . Chromosome 8ql2 is partially conserved 
in synteny with mouse chromosome 4 (Mock et al., 1996). 
Comparative gene mapping however did not reveal any mouse 
gene or marker not present on our physical map. 
35 Our physical map contains at least one disease locus, 

i.e. the pleomorphic adenoma translocation breakpoint. FISH 
analyses using YACs and cosmids revealed that the majority 
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of breakpoints clustered within a 300 kb region in both 
adenomas with the recurrent t(3;8) and in those with 
different sporadic 8ql2 translocations. A cosmid from this 
region was also shown to span the breakpoint in an adenoma 
5 with an ins(8;3) (Roijer et al. (1997)). This region is 

therefore likely to harbor the pleomorphic adenoma gene. The 
breakpoint region contains one known gene ( MPS ) , one 
polymorphic marker (D8S285) , and three new STSs (EM156, END 2 
and CH283). BLAST searches of these STSs revealed that CH283 
10 displayed sequence identity with a publicly available EST 
(expressed sequence tag) . 

Two of the eight tumors in the present series had 
breakpoints outside the 300 kb breakpoint region. By FISH we 
could demonstrate that the breakpoints in both cases were 
15 clearly proximal to MPS . In one case the breakpoint was 

mapped between MPS and YAC 898G12, and in the other case it 
was found to reside within YAC 943G4. The latter YAC maps at 
least 2 Mb centromeric to MPS . Since the two breakpoints are 
clearly different and also separate from those located in 
20 the 300 kb breakpoint region it is unlikely that they affect 
the same gene. However, we can of course not rule out the 
possibility that these cases have additional rearrangements 
that may have escaped detection with the DNA probes used, 
and that they in fact all have rearrangements of a common 
25 gene in the breakpoint region. 

Recently, we have obtained molecular evidence for an 
additional breakpoint region distal to our telomeric contig 
(Roijer et al. (in manuscript)). Based on FISH using YACs 
derived from 8ql3-21 we have estimated that these 
3 0 breakpoints are about 15 to 20 cM telomeric to the 8ql2 
breakpoint cluster region. Cytogenetically , these 
breakpoints are very difficult, or impossible to distinguish 
from the classical 8ql2 breakpoints. This is also true for 
the breakpoints located centromeric to MPS . Collectively, 
35 our findings thus suggest that there are several genes in 
the proximal part of 8q that are affected by chromosome 
rearrangements in pleomorphic adenomas. 
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In addition to pleomorphic adenomas there are also a 
few other types of solid tumor with structural 
rearrangements involving 8qll-13, namely lipoblastomas (Dal 
Cin et al. (1994); Sawyer et al. (1994)), rhabdomyosarcomas 
5 (Mitelman (1994)) and renal cell carcinomas (Elfving 
(1996)). It should also be mentioned that pleomorphic 
adenomas of the lacrimal glands show recurrent 
rearrangements of 8ql2 (Hrynchak et al. (1994)), including a 
t (3 ;8) (p21;ql2) . Because of the well known histopathological 

10 and clinical similarities between pleomorphic adenomas 
originating from the salivary and lacrimal glands it is 
likely that the 8ql2 abnormalities in these tumor types 
affect the same gene. As for the other tumor types mentioned 
it remains to be shown whether the rearrangements in these 

15 cases also affect this gene. Preliminary FISH analysis of 
lipoblastoma with an 8ql2 translocation breakpoint have 
shown that the breakpoint, as in pleomorphic adenomas, is 
distal to MPS (unpublished observations) . Unfortunately no 
markers telomeric to MPS could be tested. FISH studies are 

20 now in progress to resolve these critical questions. 

Finally, a second YAC contig was established between 
D8S260 and D8S507 , consisting of 23 CEPH MegaYACs and cove- 
ring approximately 5 Mb. Both contigs represent about 75% of 
human chromosomal band 8ql2, which extends over approximate- 

25 ly 9 Mb of DNA. They will provide molecular access to new, 
previously unidentified genes from this region and especial- 
ly to those directly affected by the chromosome 8ql2 aberra- 
tions in pleomorphic adenomas of the salivary gland. 

3 0 6_j_ Figure legends 

Figure 1. 

Physical contig and STS marker map in proximal human 
8ql2. The upper line indicates the chromosome, with the 
35 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones are represented as lines together 
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with their names and are described in Table 2. STS locations 
are indicated by dashed lines. The length of the solid line 
indicates its approximative size. The size of YAC 3 91H2 and 
392A6 is too large to fit within the constraints of the map, 
5 suggesting chimaerism. In these cases, the lenght of the YAC 
corresponds only to the markers it contains. YAC end clones 
used as STS are represented by arrows. Below the contig the 
consensus restriction maps for BssHII (B) , Kpnl (K) , Mlul 
(M) , NotI (N) , Pvul (P) , Sail (S) and Sf il (Sf ) are indica- 
10 ted. Verical arrows indicate putative CpG islands, defined 
as the colocalization of sites for (a) K, M, N; (b) K, M, P, 
S, Sf; (c) K, M, P, Sf; (d) K, N, S; (e) K, N, S; (f) B, N, 
S; (g) B, K, N, Sf. 

The pleomorphic adenoma gene region is indicated by a gray 
15 shaded box. 

Figure 2 . 

Physical contig and STS marker map in distal human 
8ql2 . The upper line indicates the chromosome, with the 

20 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones and cosmid CEM3 are represented as 
lines together with their names. YAC end clones used as STS 

25 are represented by arrows. 

Figure 10 . 

STSs used to generate the 3 00 kb cosmid contig mapping 
at chromosome 8ql2 and encompassing PLAGl . 
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TABLE 2: Analysis of YAC clones in the centromeric contig 



CEPH code 


Size fkb) 


Landm. left Acc. No. 


Landm. right Acc. No. 


Chimeric 


518G7 




CH131 




Y (left) 


518G12 




CH132 




ND 


770B2 


1410 


CH281 


CH286 


ND 


878F2 


800 


CH282 


CH287 


ND 


255H10 


390 


CH273 


CH272 


ND 


531D4 


430 


CH274 


CH277 


ND 


15H11 


430 






ND 


29 lip 12 


410 


CH29 


CH34 


Y (left) 


47E9 


370 






ND 


26F6 


460 






ND 


916F12 


1230 


CH122 


CH128 


ND 


388D6 


430 


CH69 


CHI 00 


ND 


297E11 


350 


CH280 


CH285 


ND 


253H7 


160 


CH279 


CH284 


ND 


143D5 


260 


CH278 


CH283 


ND 


391H2 


620 






Y 


392A6 


600 






Y 


84E12 


500 


CH289 


CH290 


ND 


166F4 


700 


CH67 


CH73 


ND 


164H5 


375 


END2 


END6 


ND 


206F11 




END3 


END7 


Y (right) 


253C7 


620 


END4 


END8 


ND 


295A4 




CH68 


CH74 


Y (left) 


917E11 


1380 


END1 


END5 


ND 


946B7 


1370 


CH123 


CH129 


ND 


935E9 


1660 


CH33 


CH37 


ND 



Note. YAC clones were isolated from CEPH YAC libraries as described under Materials and 
Methods. ND, not detected by methods used (FISH, YAC end rescue, restriction enzyme 
mapping). GenBank accession numbers are given. 
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TABLE 3: 8ql2 STS primer sequences, annealing temperatures and 
expected PCR product sizes 

Amplimers on published sequences centromeric contig 

STS name Nucleotide sequence 5' - 3' Product T a (°C) 

size (bp) 



GGAAGGAAAGGAAAGGAGA 
GGTTTGGGTGTTTGGTGTG 



GAGCTCAACGTAGCAAGGCT 
AACTGTCCTCCAGTGCGG 



TAATAAAGGAGCCAGCTATG 
ACATCTGATGTAAATGCAAGT 



TCTCTACCTCGACATACTCCTGC 
GTCAGAAGTGCCAATCAGACTG 



CAAACCTTGAATTACAAAACAG 
TGTTAATATTAGACCACCTTTC 



TTCTGTTTCCTGGTCT CA GTA G C 
CACATGCATATGTGTATGTGTGTG 



ACA AGAGCA CATTTAGTCAG 
AGCTTCATTTTTCCCTCTAG 



GATTGTGTCATTGCACTCCA 
ACAAGGAAGTTCCTTTTTGG 



GCATC ACACA GAATCTTTG 
ATGGGTTT ATGG CCTTT A C 



D8S1069 



AGCACAGTGGATATTTTTAGGC 
GGGGCTCACACAGAAGTTAA 
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GTCCCCCATCAACATGCTG 
CrCATCTTGTTTTCATAGTGTTCC 



TCTCATGCATGTTTCCTGTTG 
GTTGGGGTCATTAAACACTAGTCA 



TGCACCCTTAAAAAGCATCG 
ACTTGCGA ACATGGG ATC A C 



AGTGCTGTTTTTACTTCTGTACG 
GCAAGACTCTGTCTCAGGA 



A|FMB055WG9 



CTCCCAACCCA CCCGA C 
TGAAAACCATAATCTCTGATGTTGC 



Amplimers on newly isolated STSs centrom eric contig 
STS name Nucleotide sequence 5' - 3* 



Product T a CQ 
size (bp) 



CCTTTGGCTGGGGTTTATA 
GGCCTATGAAGCAAGAGAG 



GTACCCAGAAGGCAAGTAA 
GTGAAAAGGCAGAAATTAG 



TTGCATGAGAATGGAAATG 
GGCGTTACTTTCCTTTTGT 



AGTGCTTACAATAGGGTGAG 
CCATCCAGAAAGACCATAAT 



TTTGTCTTTGATTTTTATGG 
TGACCAACATACTGCCTAGT 



CTGAATCCCAGAACAATATA 
AGGGTAAGTATGTCCTTTAA 
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CH273 



ATAATGTTGAGACTTTGAGA 
A AATGTTT ATCCTA ATT GT A 



CAGGTGAGTGGATGGTGTAA 
CAAGGGGAGACCAAATCATC 



241 



CH277 



AATGGCTATGAGGTTGTTTT 
CACATCCTTTCATTTTAGCA 



GGGCTGATGTTCCATTAACT 
GCTTCAACACCAAAAATGCT 



CTGGGAAGAGATCAAAATTC 
TAAAGAGACAGCACCACAAA 



AGTAGCAGCAGCAACAGTCA 
TGCGCTATTCAGAGAAGATG 



CAGTCAGTTCCAGAGGTCATTT 
T A GGGAGGGCTTT A ATA GTGTT 



GCTCACTTCACTCCTACCC 
CAACCAACCACTAAAAACG 



GTGATTTTA C A GCC ATTTT 
TGT A ATTTTCA A CCA G A A G 



TACAAACCGGGAGAAAACAG 
TTACAGCATTTCCGATTTTG 



PCTYEP97/047S9 

31 



Amplimers on published sequences centromeric contig 

STS name Nucleotide sequence 5' - 3' Product T a (°Q 

size (bp) 



TTGACTTTTAAATATGATAGGTAT 1600 
ACTTTTATTTCTGAAAGATGAATCA 

AGGCTTGCCAGATAAGGTTG 187-2. 
GCTGAAGGCTGTTCTATGGA 

TTCCTCAGAGCAGTTCAAAG 155-1' 
TAATCTTGCCCCAGTGAGAG 

ATGCAAAGATGAACCAGGAA 216 
CCCTGGACTCATGGTACTTG 

GGATTTTAAGTTTCTACAAAGGGA 328 
ACAATTTCCATGAAGTGTTCA CC 

AGCTCAATGGCACGTCCTTT 235-; 
ACTGCTGACTCAGAGCCTGG 



Amplimers on newly isolated STSs telomeric contlg 
STS name Nucleotide sequence 5' - 3' 



Product 
size (bp) 



T a CC) 



CH31 TCACAGAATAATACAGGAT 220 

GATCACTGATGATACTAGG 

CH32 ATTTGCCTCAGTGTTGCAG 245 

ATTTTCAGGA GGTCA GGG A 



CH35 



CAAAATGACTTATGCTGAA 
TCTATACAGGGCATTGTGA 



165 60 
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EM73 AAAGCAAGACCCTGTAAAGC 351 60 

CTTGGGCTCTATTTTGTGAA 

EXAMPLE 2 

5 1. Introduction 

In this example the identification of a member of the 
PLAG gene family that appears to be of pathogenetical rele- 
vance is described. In previous experiments, it was found 
that the segment between MPS and PENK on human chromosome 
10 8ql2 harbors the recurrent chromosome t(3;8) (p21;ql2) 

breakpoint frequently found in pleomorphic adenomas of the 
salivary glands. Using a positional cloning approach, the 
PIAG1 gene was identified and its genomic organization 
characterized . 

15 The largest cytogenetic subgroup of pleomorphic adenoma 

of the salivary glands carries chromosome 8ql2 aberrations 
with 3p21 as preferential translocation partner. Here we 
demonstrate that the t(3;8) (p21;ql2) results in promoter 
swapping between PLAG1, a novel, development ally regulated 

20 zinc finger gene on 8ql2, and the constitutively expressed 
gene for B-catenin (CTNNB1) , a protein interface functioning 
in adherens junctions and the WG/WNT signalling pathway. 
Fusions occur in the 5 1 -non-coding parts of both genes, 
exchanging regulatory control elements while preserving the 

25 coding regions. Due to the t (3 ;8) (p21;ql2) , PLAG1 is activa- 
ted and expression of CTNNB1 down-regulated. Activation of 
PIAG1 was also observed in an adenoma with a variant trans- 
location t(8;15). The results indicate that PLAG1 activation 
due to promoter swapping is a crucial event in salivary 

3 0 gland tumorigenesis . 

Recently, molecular insight was obtained into the 
genetic basis of benign tumors by the discovery of a common 
genetic denominator in tumors of different types but with a 
particular cytogenetic profile ( Schoenmakers et al. (1995); 

35 Ashar et al. (1995)). Molecular analysis of recurrent 

chromosome 12ql3-15 aberrations in pleomorphic adenoma of 
the salivary glands, lipoma, uterine leiomyoma, hamartoma of 
lung and breast, fibroadenoma of the breast, angiomyxoma, 
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and endometrial polyps revealed the consistent involvement 
of the high mobility group protein gene HMGIC ; the first 
"benign oncogene" . 

Evidence is mounting that the same is likely to hold 
5 true for the closely related family member HMGIY, which is 
located on the short arm of chromosome 6. Both genes belong 
to the high mobility group (HMG) protein gene family (Bustin 
et al. (1990). Their corresponding proteins possess three 
amino-terminal AT hooks, through which the proteins are 

10 assumed to bind to A/T-rich DNA sequences, and an acidic 
tail in the carboxy-terminal region. Functionally, they act 
as architectural factors in the nuclear scaffold, critical 
for the correct assembly of stereospecif ic transcriptional 
complexes (Wolffe (1994)). 

15 The genetic aberrations frequently result in the 

separation of the three AT hooks from the acidic tail, 
although all the pivotal changes in the high mobility group 
protein genes that can lead to aberrant growth control are 
not yet known. It should be noted that a variety of chromo- 

20 some segments can act as translocation partner of 12gl3-15 
but each tumor type seems to have a preferential transloca- 
tion partner. At present, only the preferential t(3;12)(q27- 
28;ql5) translocation of ordinary lipoma has been 
characterized (Petit et al. (1996)). The gene affected on 

25 chromosome 3 is the LPP gene, which encodes a protein 

containing three LIM domains. LIM domains have been found in 
a growing number of proteins in mammals, amphibians, flies, 
worms, and plants and act as modular protein-binding 
interfaces (Schmeichel & Beckerle (1994)). LIM proteins 

3 0 mainly have a function in cell signalling and developmental 
regulation and include, for instance, transcription 
regulators, proto-oncogene products, and adhesion plaque 
constituents. The preferential t(3;12) in lipoma results in 
the formation of an HMGIC /LPP fusion transcript encoding a 

35 hybrid protein consisting of the three DNA binding domains 
of HMGI-C and the LIM domains of the protein encoded by LPP . 

In an approach to systematically unravel the molecular 
pathogenesis of benign tumors, the inventors have focused on 
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other cytogenetic subgroups of these, since their mere 
existence indicate the presence of additional critical 
genes. It is of interest to establish whether such genes are 
functionally similar or different as compared to the high 
5 mobility group protein genes HMGIC and HMGIY. In this con- 
text, the inventors have now molecular ly characterized the 
largest cytogenetic subgroup of pleomorphic adenomas. This 
type of neoplasm is an epithelial tumor occurring primarily 
in the major and minor salivary glands. It is by far the 
10 most common type of salivary gland tumor, accounting for 
almost 50 % of all tumors in these organs (Seifert et al. 
(1990)). Pleomorphic adenomas are almost exclusively benign 
tumors, which only rarely undergo malignant transformation. 
They show a marked histological diversity with epithelial 
15 and myoepithelial cells arranged in a variety of patterns in 
a matrix of mucoid, myxoid, chondroid and, on rare 
occasions, even osteoid tissues. Cytogenetically , 
pleomorphic adenomas are characterized by recurrent 
rearrangements, in particular reciprocal translocations with 
20 consistent breakpoints at 3p21, 8gl2 and 12ql3-15 (Mitelman 
(1994)). In addition to the cases with abnormal karyotypes, 
there is also a subgroup of tumors with apparently normal 
karyotypes (30 to 50% of the cases) . Abnormalities of 8ql2 
are most common and are found in about 60% of the cases with 
25 abnormal stemlines, whereas rearrangements of 3p21 and 
12ql3-15 are found in about 30% and 20% of the cases, 
respectively. The most frequent and characteristic 
abnormality so far observed in pleomorphic adenomas is a 
reciprocal t(3;8) (p21;ql2) translocation (Mark et al. 
30 (1980)). While chromosome 3p is the preferential 

translocation partner, most human chromosomes have been 
found as translocation partner of 8ql2 . Rearrangements of 
8ql2 are often found as the sole anomalies, indicating that 
they represent primary cytogenetic events of possible 
35 pathogenetic importance. 

Here, the inventors describe the positional cloning of 
the 8ql2 translocation breakpoint as well as the identifica- 
tion and characterization of the genes on chromosome 8ql2 
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and 3p21 disrupted by the t(3;8) translocation. The gene on 
chromosome 8ql2 is a novel zinc finger gene, which we have 
designated PLAG1 . whereas the gene on 3p21 is CTNNB1 which 
is the translocation partner, the gene for B-catenin, a 
5 protein with an established role in cell adhesion and signal 
transduction (Peifer (1993)). By Southern blot and RACE 
analysis, we pinpoint the breakpoints within the introns of 
the 5*-noncoding regions of both genes and show that well- 
defined fusion transcripts are expressed in the tumor cells. 

10 We also demonstrate that the t(3;8) translocation results in 
specific disruption of the transcriptional control of the 
two genes, leading to activation of PLAG1 and down- 
regulation of CTNNBl . A pleomorphic adenoma with 8ql2 but 
not 3p21 involvement has also been studied to evaluate the 

15 phenomenon of disruption of transcriptional control in a 
broader perspective. 

2^. Materials and methods 

2 . 1 Tumor material and chromosome analysis 

20 Primary pleomorphic adenomas of the salivary glands 

were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were made and analyzed 
as described by Roijer et al. (1996) . Primary pleomorphic 
adenomas, including CG368, CG580, CG644, CG752, CG753 and 

25 T9587, which all carry the recurrent t (3 ;8) (p21;ql2) as the 
sole anomaly, were selected for molecular analysis, as well 
as CG682, showing an ins (8 ; 3) (ql2 ;p21 . 3pl4 . 1) , and CG588 
which carries a t (8 ; 15) (ql2 ;ql4 ) . CG tumors are obtained 
from Department of Pathology, Goteborg University, 

30 Sahlgrenska University Hospital, S-41345 Goteborg, Sweden. 
The tumor identified as T9587 is obtained from University 
Hospital Ghent, Belgium. FISH analysis was performed as 
previously described (Roijer et al. (1996)). Slides were 
examined in a Zeiss Axiophot epif luorescence microscope 

35 using the appropriate filter combinations. Fluorescence 
signals were digitalized, enhanced and analyzed using the 
ProbeMaster FISH image analysis system (Perceptive 
Scientific Instruments, Houston, Texas) . Color prints were 
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produced using a Kodak XL 7700 monochrome continuous 
printer . 

2.2 Preparation and analysis of DNA and RNA 

5 Extraction of genomic DNA and Southern blot analysis 

were performed as described previously ( Schoenmakers et al. 
(1994)). Total RNA was extracted from biopsy samples using 
the TRIZOL (Gibco/BRL) method and Northern analysis was 
performed according to standard procedures (Sambrook et al. 

10 (1989)). DNA from YACs, cosmids, PCR products, and 

oligonucleotides was labelled using a variety of techniques. 
For FISH, cosmid clones or inter-Alu PCR products of YACs 
were biotinylated with biotin-ll-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio- 

15 labelled with a- 32 P-dCTP using random hexamers (Feinberg & 
Vogelstein (1984)). In case of PCR-products of T-genes of 
the invention smaller than 200 bp in size, a similar 
protocol was applied, but T-gene specific oligonucleotides 
were used to prime labelling reactions. Oligonucleotides 

20 were labelled using y- 32 P-ATP. 

2.3 YAC, cosmid, phage, and cDNA libraries 

YAC clones in this paper were isolated from the CEPH 
mark 1 YAC library (Albertsen et al. (1990)), using a 

25 combination of PCR-based screening and colony hybridization 
analysis (Green & Olson (1990)). YAC DNA was isolated and 
characterized as described before (Schoenmakers et 
al.(1995); Schoenmakers et al. (1994)). Cosmid clones were 
isolated from an arrayed human chromosome 8 -specific cosmid 

3 0 library (Wood et al. (1992)) obtained from Los Alamos 

National Laboratory (LANL) . LANL-derived cosmid clones are 
indicated by their unique microtiter plate addresses. Phage 
clones were derived from a genomic library constructed with 
Li-l4/SV40 DNA (Schoenmakers et al. (1994)) in AFIXII 

35 according to standard procedures. Cosmid and phage DNA was 
extracted using standard techniques involving purification 
over Qiagen tips (Qiagen) . Positive cDNA clones were 
identified in a mixed poly-dT/random-primed library in Xgtll 
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constructed from human fetal kidney (Clontech) . Library 
screening was performed by plaque hybridization using 
various DNA probes, derived from subsequently isolated cDNA 
clones, according to the manufacturer's instructions. 

5 

2.4 DNA sequencing and computer analysis 

Nucleotide sequences were determined according to the 
dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/ LKB or the dsDNA Cycle Sequen- 
10 cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
standard 30 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI) . 

15 

2.5 PCR amplification of genomic DNA 

PCR amplifications were carried out essentially as 
described before ( Schoenmakers (1994)). The following 
amplimers were used to generate a PLAG1 exon 1 probe (5'-CAA 

20 TGG CTG CTG GAA AGA GG-3 » and 5'-CCC GTC CGC CGC CTC TAC 
ACC-3') , a PLAG1 ORF probe (5*-CGT AAG CGT GGT GAA ACC AAA 
C-3 ' and AGG GTC GTG TGT ATG GAG GTG A- 3 ' ) , a PLAG1 3 1 -UTR 
probe (5'-ACA TGG CAT TTC GTG TCA CT-3 » and 5»-CCA CAA TGG 
CTC TAG AT-3 1 ) and a CTNNB1 exon 1 probe (5 '-TGT GGC AGC AGC 

25 GTT GGC CCG GC-3 « and 5 ' -CTC AGG GGA ACA GGC TCC TC-3 • ) . 

2.6 Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3' cDNA-ends (3' -RACE) was 
performed using a slight modification of part of the 

30 GIBCO/BRL 3 1 -ET protocol. For first strand cDNA synthesis, 
adapter primer (AP2) AAG GAT CCG TCG ACA TC(T)17 was used. 
For both initial and secondary rounds of PCR, the universal 
amplification primer (UAP2) CUA CUA CUA CUA AAG GAT CCG TCG 
ACA TC was used as "reversed primer" . In the first PCR round 

35 the following specific "forward primers" were used: i) 5 1 - 
CAA TGG CTG CTG GAA AGA GG-3' (exon 1) or ii) 5»-AGA ATT TGG 
GCC TCA GAC AAG ATA- 3 ' (3'-UTR, exon 5). In the second PCR 
round the following specific forward primers (nested primers 
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as compared to those used in the first round) were used: i) 
5'-CAU CAU CAD CAU GGC CGG AGG GAG GAT GTT AA-3 ■ (exon 1) or 
ii) 5' -CAU CAU CAU CAU ATT GTC CTG GGT TGA TTA TGC AT- 3 ' 
(3»-UTR, exon 5). CUA/CAU-tailing of the nested, specific 
5 primers allowed the use of the directional CloneAmp cloning 
system (GIBCO/BRL) . 

For 5 '-RACE experiments, the Marathon cDNA Amplificati- 
on kit (Clontech) was used according the manufacturer's 
instructions with minor modifications. The 5 ' -untranslated 

10 end of the normal PIAG1 transcript as well as the chimeric 
transcripts were isolated by 5' -RACE. First strand placenta 
or adenoma cDNA respectively, was synthesized from 5 fxg 
total RNA using the MV2 primer (5' -CTG CAC TTG ACC CAC CCC 
TTG GAT-3 ' ) located in exon 5. The ds cDNA was ligated to 

15 the adaptor and amplified using the anchor primer API and 
the MV5 primer (5«-CAG GAG AAT GAG TAG CCA TGT GC-3 • ) also 
located in exon 5. A second round of PCR was performed using 
the anchor primer and the MVS primer (5'-TGC ACT TGT AGG GCC 
TCT CTC CTG-3') located in exon 4. The final PCR products 

2 0 were purified out of agarose gel and cloned into the pCRIl 
vector (Invitrogen) . 

2 . 7 RT-PCR 

Total RNA (5 fig) was reverse-transcribed using Super- 
25 script II reverse transcriptase (GIBCO BRL) and oligo d(T) 
primers according to the recommended conditions. 0.25 fig of 
the resulting cDNA was subject to amplification using a 
variety of primer sets. The amplification conditions for the 
CTNNB1 /PLAG1 fusion transcripts were 3 0 cycles at 9 4 °C for 
30 10 sec and 68 °C for 1 min in a final volume of 50 fxl using 
the Expand long template PCR system (Boehringer Mannheim) . 
The first round PCR was carried out with the CTNNB1 primer 
5«-TGT GGC AGC AGC GTT GGC CCG-3 * (CAT-UP) and the PLAG1 
primer 5«-CAG GAG AAT GAG TAG CCA TGT GC-3 » (MV5) . The 
35 second round was performed on a 20 fold diluted sample with 
the CTNNB1 primer 5'-ACG GAG GAA GGT CTG AGG AGC AG- 3 • 
(NECAT-UP) and the PLAG1 primer 5' -TGC ACT TGT AGG GCC TCT 
CTC CTG-3' (MV6) . To amplify the reciprocal PLAGI/ctnnbi 
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fusion transcript two rounds of PCR amplification were 
performed with 30 cycles at 94 °C for 3 0 sec, 63 °C for 3 0 
sec and 72 °C for 1 min in a final volume of 50 fil. The 
first round was carried out with the PIAG1 primer 5 ' -CAA TGG 
5 CTG CTG GAA AGA GG-3 ' (START-UP) and the CTNNB1 primer 5'- 
AAG GAG CTG TGG TAG TGG CAC-3 1 (CAT3) . The second round was 
performed on a 20 fold diluted sample with the PLAG1 primer 
5'-GGC CGG AGG GAG GAT GTT AA-3 ' ( START-RACE ) and the CTNNB1 
primer 5'-GCC GCT TTT CTG TCT GGT TCC A-3 1 (CAT3NEST) . 

10 

3 . Results 

3 . 1 The chromosome 8ql2 breakpoint cluster region in pleo- 
morphic adenoma 

As part of a positional cloning effort to isolate the 

15 gene(s) affected by the t(3;8) (p21;ql2) in pleomorphic 

adenomas, we constructed a yeast artificial chromosome (YAC) 
contig consisting of 34 overlapping YACs, spanning about 2 
Mb within band 8ql2 (to be published elsewhere) . A long 
range STS and rare cutter physical map of this region was 

20 also developed. In previous experiments the t(3;8) break- 
point was mapped to a 1 Mb region flanked by MPS as proximal 
and by D8S166 as distal marker. One YAC within this region 
was shown to span the t(3;8) breakpoint in two tumors (CG588 
and CG644) . To further narrow down the breakpoint region we 

25 isolated cosmids corresponding to landmarks within this and 
other YACs in our contig (Fig. 3 A) . By FISH, we could 
demonstrate that the breakpoints in four out of four adeno- 
mas tested (CG580, CG588, CG644 and CG682) clustered within 
a 300 kb subregion located between MPS and STS CH129 (Fig. 

3 0 3A) . Two cosmids from this region, CEM23 and CEM48, were 
shown to span the breakpoints in adenomas CG644 (Fig. 3B) 
and CG682 (data not shown) , respectively. To refine the 
distribution of breakpoints within this 3 00 kb subregion, we 
developed a contig consisting of 29 phage and cosmid clones 

35 (Fig. 3A) . 



3.2 Identification and characterisation of the PLAG1 gene 
in the 8ql2 breakpoint region 
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While initiating experiments to identify candidate 
genes mapping within the 300 kb genomic segment harbouring 
the adenoma breakpoints, BLAST searches (Altschul et al. 
(1990)) of newly obtained STSs mapping within the 300 kb 
5 contig revealed that the right end (CH283) of YAC clone 

143D5 displayed sequence identity with a publicly available 
EST (expressed sequence tag) (GenBank Accession number 
D59273). its position in the contig raised the possibility 
that this EST belonged to the candidate pleomorphic adenoma 

10 gene (PLAG1) , we were searching for. 

In initial Northern blot experiments using a PCR probe 
corresponding to this particular YAC end, a 7.5 kb 
transcript was detected. A combination of sequential scree- 
nings of a human fetal kidney cDNA library as well as 5 ' - 

15 and 3 1 -RACE experiments led to the isolation of a composite 
cDNA of 7313 nucleotides (GenBank accession number U65002) . 
This cDNA contains an open reading frame (ORF) of 1500 bp 
starting with the ATG at position 481-483 (Fig. 4A) . An in 
frame stop codon (TAG) is present 9 nucleotides upstream of 

20 this ATG. The deduced amino acid sequence reveals seven 
canonical C2H2 zinc finger domains (Fig. 4B) and a non- 
finger region of 259 amino acid residues representing the 
carboxy-terminus of the deduced protein. The zinc finger 
motifs including the linker sequences are between 2 8 and 35 

25 amino acids long. The cysteine (C) and histidine (H) residu- 
es are present in their characteristic positions in each 
finger. The typical phenylalanine (F) and leucine (L) resi- 
dues in finger 4 and 6 are lacking. Strictly spoken, the 
deduced protein is not a Kruppel zinc finger protein, since 

30 it does not contain the characteristic H/C linker (consensus 
sequence TGEKPYK) in between the zinc fingers (Bellefroid et 
al. (1989)). Only the seven amino acids between finger 1 and 
2 resemble the H/C linker (TGERPYK) . The amino-terminal 
region contains two nuclear localization signals (KRKR and 

35 KPRK) . The carboxy-terminus is serine-rich (45 amino acid 
residues out of 259, i.e 17%), raising the possibility of a 
regulatory role that may be controlled by serine/threonine 
kinases . 
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Outside the ORF, some interesting features in the cDNA 
can be observed. In the 3' untranslated region (UTR) , which 
is 53 3 3 bp long, a polyadenylation signal is present star- 
ting at position 7297, and there is also a TG-repeat, which 
5 might be of regulatory relevance. In addition, there is an 
ATTTA sequence motif, which has previously been shown to 
mediate mRNA destabilization in certain lymphokine and 
immediate early genes (Sachs (1993)). The ATTTA motif is 
repeated twelve times in the 3' -UTR. 

10 No sequence similarity was found to any of the known 

zinc finger proteins but preliminary studies have indicated 
that the human genome contains at least one additional gene 
that is closely related to our candidate pleomorphic adenoma 
gene. Nine anonymous ESTs (expressed sequence tags) with 

15 sequence similarity to sequences of our new gene were found 
in public databases and these ESTs could be aligned resul- 
ting in a contiguous sequence of about 1 kb. An open reading 
frame appeared to be present with coding potential for 
protein sequences with zinc-finger- like features. Sequence 

20 identity between the aligned ESTs and our new gene appeared 
to be about 75% at the amino acid sequence level; i.e. in 
the region of our gene encoding zinc fingers 4 to 7 . Of 
interest to note furthermore is that these anonymous ESTs 
sequences have been mapped to chromosome 6q24, which is 

25 implicated in tumors of the salivary glands. 

3 . 3 Genomic organization of the PLAG1 gene 

Comparison of transcribed and genomic DNA sequences of 
the PLAG1 gene revealed that it contains 5 exons (Fig. 3 A) . 

3 0 The transcriptional orientation of the gene is directed 

towards the centromere. The first three exons are noncoding, 
the fourth exon contains the translation start site (ATG) , 
and the amino-terminus of the protein including one complete 
zinc finger domain. The second finger is split by intron 4 

3 5 and continues into exon 5, which contains the remaining part 
of the ORF and the long 3»-UTR (553 3 bp). Based on our YAC 
and cosmid maps, the PLAGl locus spans about 3 5 kb, with a 
large intron (approximately 25 kb) between exon 1 and exon 
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2. 3' -and 5 1 -RACE analysis revealed the existence of two 
alternatively spliced irtRNAs which differ from each other by 
the presence or absence of the noncoding exon 2. The diffe- 
rence in electrophoretic mobility of the two mRNA isoforms 
5 is too small to distinguish them by Northern blot analysis. 
The functional relevance of this alternative splicing re- 
mains to be elucidated. 

3.4 Rearrangements of PLAG1 in primary pleomorphic adenoma 
10 In an attempt to establish whether the 8ql2 aberrations 

in primary pleomorphic adenomas with t(3 ;8) (p21;ql2) led to 
rearrangements in PLAG1 . Southern blot experiments were 
performed using various PLAGl -derived probes. With a probe 
within the ORF (STS probe EM265) or the 3«-UTR (KK64) of 

15 PLAGl . no rearrangements were detected. However, using 
probes corresponding to the 5 • noncoding region of PLAG1 . 
i.e. intron 1 (STS probes EM416 or EM317) or exon 2 {STS 
probe EM440) , rearrangements were detected in each of the 
t(3;8) tumors tested (data not shown). Similar Southern blot 

20 analysis of primary pleomorphic adenoma CG580, which carries 
a t(8;15) (ql2;ql4) , revealed also a rearrangement in the 5 1 
noncoding region of PLAGl (Fig. 5) . These results seem to 
directly implicate PLAGl as a critical gene in pleomorphic 
adenoma tumorigenesis . Furthermore, the mechanism might 

25 involve transcriptional deregulation of PLAGl, since the 
rearrangements in PLAGl were invariably found in the 5 ' 
noncoding region of PLAGl leaving its coding region intact. 

3.5 t(3;8) results in promoter swapping between PLAGl and 
30 the gene for 6-catenin, CTNNB1 

The observation that the 8ql2 translocation breakpoints 
in various primary tumors were mapped to intron 1 or intron 
2 of the 5 '-noncoding region of PLAGl raised the possibility 
that the t(3;8) results in the production of a chimeric 
35 transcript consisting of PLAGl sequences fused to those of a 
gene located on chromosome 3p21. To test this possibility, 
5 ' -RACE experiments were performed using total RNA of the 
primary tumors CG644 and CG682, which both carry a 3; 8- 
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rearrangement. We used a primer specific for exon 5 (MV2) in 
the cDKA synthesis. Following ligation of an adaptor to the 
cDNA, PCR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl -specif ic 
5 primer corresponding to sequences of exon 5 (MV5) and, in a 
second round, nested adaptor-specific primer and PLAGl- 
specif ic primer corresponding to sequences of exon 4 (MV6) . 
Nucleotide sequence analysis of the PCR product revealed 
that the ectopic sequences were fused to the acceptor splice 

10 site of exon 3 of PLAG1 . BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 (Nollet et al. 
(1996)), the gene for 6-catenin, which has previously been 
assigned to chromosome 3p21 (Kraus et al. (1994)). 

Independent evidence for the involvement of CTNNB1 in 

15 the t( 3; 8) -rearrangements in pleomorphic adenomas was obtai- 
ned by FISH using two YACs (756G5 and 750D3) , which contain 
the complete gene for 6-catenin. According to published maps 
YAC 756G5 is contained within the proximal part of mega-YAC 
750D3 (Gemmill et al. (1995)). As expected, both YACs 

20 spanned the 3p21 breakpoint in adenoma CG682, with 

hybridization signals on the normal 3, the der(3) and the 
der(8) (data not shown). Detailed analysis of a few 
prometaphase cells enabled us to sublocalize the 6-catenin 
locus to band 3p21.3, which is in accordance with previous 

25 mapping data. 

To extend our observation to a larger number of tumors, 
we applied a reverse transcription-polymerase chain reaction 
(RT-PCR) approach. RT-PCR amplifications using primers 
specific for CTNNB1 and PLAG1 were carried out with RNA from 

30 tumors CG368, CG588, CG644 , CG752, CG753 and T9587 , which 
all carry the recurrent t(3;8), and from tumor CG682, which 
carries an ins(8;3) (ql2 ?p21. 3pl4 . 1) . RNA from tumor CG580, 
which carries a t(8;15), was included as a negative control. 
PCR experiments resulted in the generation of PCR products 

35 corresponding to hybrid transcripts consisting of PLAG1 and 
CTNNB1 sequences, in seven out of seven t(3;8) tumors analy- 
zed (Fig. 6). In tumors CG368, CG588, CG682, CG752 , and 
T9587, PCR products of 509 bp (Fig. 6A, PCR product A) and 
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614 bp (Fig. 6A, PCR product B) were generated, whereas in 
tumors CG644 and CG753, only the PCR product of 509 bp was 
found. The PCR product of 509 bp (from NECAT-UP up to MV6) 
is corresponds to a hybrid transcript containing exon 1 of 
5 CTNNB1 and exons 3 to 5 of PLAGl . The PCR product of 605 bp 
contains an extra 105 bp, which corresponds to the alterna- 
tively spliced exon 2 of PLAG1 . It points towards the pre- 
sence of a related isoform consisting of exon 1 of CTNNBl 
and exons 2 to 5 of PLAGl . This was also confirmed by nucle- 

10 otide sequence analysis of the PCR products. We were also 
able to demonstrate that the corresponding reciprocal fusion 
transcripts are expressed. In all tumors except CG682, a PCR 
product of 13 0 bp was generated corresponding to a fusion 
transcript consisting of exon 1 of PLAGl and exons 2 to 16 

15 of CTNNB1. In tumor CG753, an additional PCR product was 

detected, corresponding to a fusion transcript consisting of 
exons 1 to 2 of PLAGl and exons 2 to 16 of CTNNB1 . This 
additional band was also observed but with weak intensity in 
tumor CG644. All these results indicate that in CG644 and 

20 CG753, the 8ql2 translocation breakpoints are located in 
intron 2, whereas the breakpoints of tumors CG368, CG588, 
CG682, CG752, and T9587 are located in intron 1. Interestin- 
gly, using 5 f RACE analysis with the tumor CG580, which 
carries a t(8;15) translocation, we found that the break- 

25 point occurs also in the same region leading to a fusion 
transcript with ectopic fused to exon 3 of PIAGl . 

3 . 6 Activation of PLAGl expression due to promoter swapping 
Northern blot analysis was performed to evaluate the 

30 effect of the t (3 ;8) (p21;gl2) translocation on the expressi- 
on levels of PLAGl and CTNNBl (Fig. 7). As previously menti- 
oned, PLAGl is expressed as a 7.5 kb transcript, which was 
readily detected in human fetal lung, liver, and kidney but 
not in fetal brain. In adult tissues, the 7 . 5 kb transcript 

35 was not detected in human heart, brain, lung, liver, skele- 
tal muscle, kidney, pancreas, and salivary gland. Low levels 
of PLAGl expression was found in human placenta (Fig. 7A) . 
In contrast, the CTNNBl gene was ubiquitously expressed as a 
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3.8 kb RNA doublet in all tissues tested (Fig. 7B) . Similar 
results were obtained in the Northern evaluation of fetal 
and adult mouse tissues (data not shown) . 

Northern blot analysis was performed on two pleomorphic 
5 adenomas with t(3;8) and on one tumor with a variant t(8;15) 
(Fig. 7C) . In contrast to normal salivary gland tissue in 
which no PLAG1 transcripts could be detected, the three 
tumors expressed a 7.5 kb transcript. This transcript does 
not correspond to the normal PLAG1 transcript (exons 1 to 

10 5) , since an exon 1-specif ic probe failed to hybridize to 
this transcript (data not shown) . Instead, a probe specific 
for exon 1 of CTNNB1 recognized this 7.5 kb transcript as 
well as the 3.8 kb doublet of CTNNB1 transcripts in the two 
tumors with a t(3;8). In contrast, only the 3.8 kb doublet 

15 of CTNNBl transcripts was detected in adenoma CG580 with a 
variant t(8;15) . The expression level of CTNNBl in this 
tumor was about twofold higher as compared to tumors CG644 
and CG682, which is expected since none of the CTNNBl alle- 
les is rearranged. Collectively, these results suggest that 

20 PLAGl expression in these pleomorphic adenomas is driven by 
the 5 1 regulatory sequences either of the CTNNBl gene for 
the cases with t(3;8) or of an as yet unknown gene on chro- 
mosome 15 for the case with t(8;15). These results indicate 
that the principal mechanism in pleomorphic adenomas with 

25 8ql2 rearrangements is activation of PLAGl expression, due 
to promoter swapping between PLAGl and a translocation 
partner gene, preferentially CTNNBl. Furthermore, expression 
of the translocation partner gene is likely to be down- 
regulated concomitantly; e.g. as demonstrated for CTNNBl . 

30 

4_i. Discussion 

We have characterized a novel gene, PLAGl . which we 
propose is a critical locus involved in the development of 
pleomorphic adenoma of the salivary glands. The gene was 
35 identified on the long arm of chromosome 8, close to the MPS 
proto-oncogene, as a result of a positional cloning project 
to molecularly define the t(3 ;8) (p21;ql2) , which is the most 
frequent chromosome aberration in pleomorphic adenomas. We 
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have also identified the translocation partner gene at 3p21, 
i.e. the CTNNB1 gene which encodes B-catenin. In addition to 
8ql2 and 3p21, chromosome rearrangements involving 12ql3-i5 
are also frequently observed in pleomorphic adenomas. The 
5 high mobility group protein gene HMGIC was recently identi- 
fied as the gene consistently rearranged in these cases 
(Schoenmakers et al. (1995)). In conclusion, we have 
molecular ly defined the three main chromosomal 
rearrangements implicated in the genesis of pleomorphic 
10 adenomas. 

Structural rearrangement of the 8qll-13 region are 
consistently found also in benign lipoblastomas. These 
tumors are considered to be unique variants of lipoma, 
occurring primarily in children before 3 years of age. Among 

15 the seven cases so far analyzed cytogenetically, rearrangem- 
ents involving 8qll-13 were observed in six cases (Mitelman 
(1994)). similar to pleomorphic adenomas, the translocation 
partners of 8ql2 are variable. The similarities in 
chromosomal pattern between these two tumor types suggest 

20 that PLAG1 is the target of the 8qll-13 rearrangements also 
in lipoblastomas. 

We have also elucidated the molecular consequences of 
the t(3;8) translocation. The 8ql2 translocation breakpoints 
were invariably found in the 5 ' -non-coding region of PIAG1 

25 in all seven primary tumors tested. The breakpoints in the 
CTNNB1 gene were always found in the first intron. As a 
result of the translocation, the coding sequences of PLAG1 
are brought under control of the 5' regulatory sequences of 
tne CTHNBl gene, and vice versa, resulting in the activation 

3 0 of PLAG1 and down-regulation of CTNNB1. Based on their 

expression patterns in normal fetal and adult tissues, it is 
clear that the regulation of expression of the two genes is 
very different. The CTNNB1 gene is highly and ubiquitously 
expressed, whereas PIAG1 is developmentally regulated, with 

35 expression restricted to fetal tissues. It should be empha- 
sized that in adult tissues, PLAG1 is either not expressed 
or expressed at very low levels. In the adenomas, both the 
CTNNByPTAGi and the reciprocal PLAG l / CTNNB 1 transcripts 
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were detected by RT— PCR; detection of the latter transcripts 
sometimes required three rounds of PCR, indicating that the 
expression levels are very low. Our findings represent the 
first example of reciprocal exchange of expression control 
5 elements in solid tumors. Since the coding sequences of both 
genes are invariably preserved, the molecular mechanism 
could be classified as promoter swapping, resulting in 
activation of PLAG1 and down-regulation of CTNNB1 
expression. 

10 Since the t(3;8) in pleomorphic adenomas invariably 

leads to activation of PLAGl, it is of interest to evaluate 
possible physiological implications. The PLAGl gene encodes 
a protein with a deduced molecular weight of 56 kDa and a 
deduced pi of 8.56. Analysis of the open reading frame of 

15 PLAGl reveals seven zinc fingers in the amino-terminal 

region. The carboxy-terminal region is rich in serine resi- 
dues. Furthermore, two potential nuclear localization sig- 
nals are present (residues 22-25 and 29-32) . Collectively, 
this suggests that the PLAGl protein is a novel member of 

20 the large zinc finger gene family. Zinc finger motifs were 
originally identified as DNA binding structures in the RNA 
polymerase III transcription factor TFIIIA, which binds to 
the internal control region of the 5S RNA gene. TFIIIA-like 
zinc fingers are present in a variety of regulatory proteins 

25 found in higher and lower eukaryotes. This type of zinc 

finger motif consists of «30 amino acids with two cysteine 
and two histidine residues (C2H2) that stabilise the domain 
by tetrahedrally coordinating a Zn 2+ ion. A region of «12 
amino acids between the invariant cysteine-histidine pairs 

30 is characterized by scattered basic residues and several 
conserved hydrophobic residues. Most zinc finger genes 
studied so far encode polymerase II transcription factors. 
Apart from transcriptional modulation and control of RNA 
metabolism, chromatin packaging might also constitute an 

35 important activity through which zinc finger proteins exert 
their regulatory roles (El-Baradi & Pieler (1991)). It 
should also be noted that the presumed role of HMGIC is also 
in chromatin modelling (Schoenmakers et al. (1995) ; Ashar et 
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al. (1995); Wolffe (1994)). The mammalian genome is known to 
contain a large number of C2H2 zinc finger genes (Bellefroid 
et al. (1989)) and the number of such genes implicated in 
cancer is growing steadily. 
5 The function of the serine rich carboxy-terminal part 

of PIAG1 is unknown but it may have a regulatory function 
that can be controlled by serine/threonine kinases. If PLAG1 
encodes a DNA binding protein, as the presence of its zinc 
fingers suggests, the carboxy-terminal region might 

10 represent the transactivation domain. At least four 

different primary sequence motifs that characterize the 
activation domains are identified thus far, i.e. acidic, 
glutamine-rich, proline-rich and serine/threonine-rich. They 
likely represent regions that functionally interact with 

15 other proteins (Mitchell & Tjian (1989)). 

Activation of PLAG1 due to promoter substitution may 
lead to deregulation of genes normally controlled by PLAGl . 
If overexpression of PLAGl is indeed a major molecular 
consequence of the 8ql2 translocations than this could 

2 0 possibly be achieved also by other mechanisms, including for 
instance an increase in gene copy number. In this context it 
should be noted that a small subgroup of pleomorphic adeno- 
mas show trisomy 8, often as the sole anomaly (Mitelman 
(1994) ) . In addition, trisomy 8 is a common numerical 

25 abnormality found in several histological subtypes of 
malignant salivary gland tumors as well as in different 
subtypes of leukemias and sarcomas (Mitelman (1994)). 

The preferential fusion partner of PLAGl is the CTNNB1 
gene, which encodes B-catenin, a cytoplasmic protein of 

30 about 88 kDa (Gumbiner & McCrea (1996)). Its frequent 
involvement in pleomorphic adenomas as found here might 
point towards a critical role of CTNNB1. Most likely, a role 
of CTNNB1 is to provide an active promoter in front of the 
PLAGl gene. However, since the observed promoter swapping 

35 between PLAG and CTNNB1 leads to down-regulation of CTNNB1 
expression, it may also lead to other physiological 
consequences, especially since B-catenin is a broad-range 
protein interface, as that has been implicated in highly 
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diverse processes, such as human colon cancer, epithelial 
cell adhesion, embryonal axis formation in Xenopus , and 
pattern formation in Drosophila (Peifer (1993)). The B- 
catenin protein has also been found as structural component 
5 of adherens junctions (AJs) (Peifer (1993); Kemler (1993); 
Gumbiner (1996) ) , which are multiprotein complexes assembled 
around Ca 2+ -regulated cell adhesion molecules (cadherins) . 
B-Catenin binds directly to cadherins and acts as a protein 
interface between cadherin and the cytoskeleton. The 

10 cadherin-6-catenin complex mediates cell adhesion, 

cytoskeletal anchoring, and signalling, which are important 
processes for regulation of cell growth and behaviour. It 
has been suggested that AJ complexes may play a role in the 
transmission of signals for contact inhibition (Peifer 

15 (1993)). Down-regulation of 6-catenin, as occurs in 

pleomorphic adenomas, could interfere with this transmission 
and lead to growth deregulation. Since B-catenin levels 
apparently are important physiologically, it might be that 
down-regulation of CTNNB1 expression plays some role in 

20 pleomorphic adenoma development, possibly synergistically to 
PLAGl. In conclusion, as activation of PLAG1 is observed in 
all pleomorphic adenomas with 8ql2 involvement tested, this 
seems to constitute a major factor in salivary gland tumor 
formation. The critical role of down-regulated levels of B- 

25 catenin remains to be established. 

Previous studies of HMGIC have shown that rearrange- 
ments of this gene preferentially occur in particular cyto- 
genetic subgroups of benign mesenchymal tumors, with the 
only exception so far being a subgroup of pleomorphic adeno- 

30 mas (Schoenmakers et al. (1995)). When considering the 
findings in the latter type of tumors, it should be noted 
that, although they are epithelial in origin, they often 
also show a varying degree of mesenchymal differentiation. 
This may be explained by the fact that pleomorphic adenomas 

35 are thought to originate from a pluripotent reserve cell in 
the terminal duct system which possesses the capacity to 
differentiate into both epithelial and myoepithelial cells 
(Evans & Cruickshank (1970)). The latter cells may function 
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as facultative mesenchymal cells, and are responsible for 
the production of extracellular material, including myxo- 
chondroid stroma (Dardick et al. (1991)). An intriguing 
question in this context is of course whether HMGIC is 
5 preferentially affected in adenomas with a prominent stromal 
component while PLAG1 is preferentially affected in tumors 
with little or no stroma. Histological re-examination of all 
but one of the adenomas with PLAG1 involvement in this 
study, revealed that all cases contained mesenchymal 

10 components, including both myxoid and chondroid tissues, 
suggesting that mesenchymal differentiation is not 
restricted to adenomas with HMGIC involvement. It should be 
noted that previous cytogenetic observations have already 
indicated that chromosome 12ql3-15 and 8ql2 rearrangements 

15 constitute mutually exclusive pathways for the evolution of 
pleomorphic adenomas (Sandros et al. (1990)). 

The identification and cloning of a novel "benign 
oncogene" activated by chromosome translocations constitute 
a major step towards an increased understanding of the 

20 molecular pathogenesis of benign neoplastic growth. Our 

findings may lead to new insights into the genetic differen- 
ces between benign and malignant tumors. In addition, the 
present findings may have diagnostic implications. The fact 
that PLAG1 is not normally expressed in adult tissues, but 

25 activated in pleomorphic adenomas due to 8ql2 rearrange- 
ments, makes it a potentially useful tumor marker in the 
differential diagnosis of benign and malignant salivary 
gland tumors. On the basis of the fusion transcripts that 
are formed, highly specific diagnostic assays can be develo- 

30 ped, as will be illustrated in the following examples. 

FIGURE LEGENDS 
Figure 3 

A: Contig of 3 overlapping YACs (bold lines) , 27 cos- 
35 mids and 2 phages, containing 27 landmarks and spanning a 
300 kb DNA region on chromosome 8ql2. Contig elements are 
labelled or numbered and defined in the list below. Cosmid 
clones isolated from the arrayed chromosome 8-specific 
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cosmid library constructed at Los Alamos National Laboratory 
(LANL) (Wood et al. (1992)) are named after their unique 
microtiter plate addresses. #1 and #22 are genomic phage 
clones; #21 is a clone isolated from the non-arrayed LANL 
5 chromosome 8-specific cosmid library. The orientation of the 
contig on the long arm of chromosome 8 is given as well as 
the order of 27 landmarks. It should be noted that the 
contig is not scaled. Below the contig, the genomic 
organization and the relative location of the PLAG1 gene is 

10 given schematically, with exact sizes (bp) of its exons and 
estimated sizes (kb) of its introns. Noncoding sequences are 
represented as open boxes and coding sequences as black 
boxes. The relative positions of the translation initiation 
(ATG) and stop (TAG) codons in PLAG1 are indicated. At the 

15 bottom of the figure, characteristics of the deduced protein 
encoded by PLAG1 are given. Zinc fingers are labelled F1-F7. 



1 = PEM18 - CH33 =D8SXXX 

2 = 96E9- PENK =D8SXXX 
20 3 = 41E12 - PENK =D8SXXX 

4 = 17B6 *- EMI 72 = D8SXXX 

5 = 93B2 - EMI 72 =D8SXXX 

6 = 47D11 -CH280 =D8SXXX 

7 = S3B10 - CH280 =D8SXXX 
25 8 = 66E5-CH280 =D8SXXX 

9 = 144A7 - CH280 =D8SXXX 

10 = 163DU -CH280 =D8SXXX 



11 = 163E1 -CH280 =D8SXXX 

12 = 203B8 - CH280 =D8SXXX 

13 = S0B7-EM187 =D8SXXX 

14 = 115H9 - EM187 = D8SXXX 

15 = 129BI0-CH129 =D8SXXX 

16 = 116A2 -CH129 =D8SXXX 

17 = 7B7 - CHI 29 = D8SXXX 

18 = 4H4 - EM156 =D8SXXX 

19 = 4H5 - EM156 = D8SXXX 

20 = 4H6 - EM156 = D8SXXX 



21 = CEM55 <- EM195 =D8SXXX 

22 = PEM49- EM3I7 =D8SXXX 

23 =149G12 -D8S285 

24 = 64C1 -D8S285 

25 = 64D1 - D8S285 

26 = 143B1 - MOS =D8SXXX 

27 = 143H1 - MOS =D8SXXX 

28 = 148E9- MOS =D8SXXX 

29 = 22B8 - MOS =D8SXXX 



(clone number, cosmid or phage clone, STS, accession number) 

30 

B: Mapping of the 8ql2 translocation breakpoint in an 
adenoma (CG644) with a t (3 ;8) (p21;ql2) . Cosmid CEM48 (white 
spots) was co-hybridized with an alpha-satellite probes one 
specific for chromosome 3 and the other for chromosome 8 
35 (black spots) . Hybridization signals were found on the 
normal 8, the der (3), and the der (8), indicating that 
CEM48 spans the t ( 3 ; 8 ) (p21 ;ql2 ) breakpoint. Chromosomes are 
stained in blue with DAPI. 

40 Figure 4 

A: cDNA and deduced amino acid sequence of PLAG1 . The 
relative positions of exon/intron boundaries are indicated 
by triangles (t) . The conserved C, F, L and H residues in 
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the zinc finger domains are underlined. Residues 22-25 and 
29-32 constitute two potential nuclear localization signals. 
A potential polyadenylation signal and the putative mRNA 
destabilizing ATTTA motifs in the 3'-noncoding region are 
5 underlined. The nucleotide sequence of the complete cDNA has 
been deposited at GenBank under accession number XXX. 

B: Alignment of the seven zinc-finger-like motifs found 
in the deduced amino acid sequence of PIAG1 relative to the 
C2H2 consensus motif. The canonical C, F, L and H residues 
10 are given in bold. 

Figure 5 

Illustrative example of the detection of DNA rearrange- 
ment in PLAG1 of a primary pleomorphic adenoma using Sout- 
15 hern blot analysis. DNA of pleomorphic adenoma CG368 (panel 

I) and control DNA isolated from normal lymphocytes (panel 

II) were digested with restriction endonuclease BamHI (B) , 
EcoRI (E) , Hindlll (H) , or PstI (P) , as indicated. A probe 
with specificity for exon 3 of PLAG1 (EM440) was used. The 

20 molecular weight markers are 23.1, 9.4, 6.5, 4.3, 2.3 and 
2.0 kb, respectively. 

Figure 6 

A: Detection of CTNNB1 /PLAG1 and PLAG 1 / CTNNB 1 fusion 
25 transcripts by RT-PCR in primary adenomas. I. CTNNB 1/ PLAG 1 
were detected using the RT-PCR protocol and primers descri- 
bed in detail in Methods. Primary tumors analyzed included 
CG368 (lane 1), CG588 (lane 2), CG644 (lane 3), CG682 (lane 
4), CG752 (lane 5), CG753 (lane 6), T9587 (lane 7), and 
30 CG580 (lane 8). II. PLAG1/ CTNNB 1 fusion transcripts were 
detected similarly using the same samples as under "I*. 
Details of the primers used here are given in Methods Secti- 
on. PCR products are labelled A-D. 

B: Schematic representation of the nature and origin of 
35 CTNNB 1/PLAG1 and PLAG1 / CTNNB 1 fusion transcripts in primary 
pleomorphic adenomas with t(3 ;8) (p21;gl2) . At the top of the 
figure, the exon/intron distribution of the PLAG1 gene is 
given, at the bottom, the exon/intron distribution for the 
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CTNNB1 gene. Positions of chromosome breakpoints are indica- 
ted by an arrow (N ) . Translation initiation sites are indi- 
cated by asterisks (*) and stop codons by triangles (t) . A- 
D: schematic exon compositions of hybrid transcripts, as 
5 established by 5* -RACE analysis. A: cDNA sequence junction 
between exon 1 of CTNNB1 and exons 3-5 of PLAG1. B: cDNA 
sequence junction between exon 1 of CTNNB1 and exons 2-5 of 
PLAG1. C: cDNA sequence junction between exon 1 of PLAG1 and 
exons 2-16 of CTNNB1 . D: cDNA sequence junction between 
10 exons 1-2 of PLAGl and exons 2-16 of CTNNB1 . 

Figure 7 

A: Northern blot analysis of the expression pattern of 
PIAG1 in normal human fetal tissues including brain (1) , 
15 lung (2), liver (3), kidney (4) as well as adult tissues 
including heart (5), brain (6), placenta (7), lung (8), 
liver (9), skeletal muscle (10), kidney (11), and pancreas 
(12). 

B: Northern blot analysis of the expression pattern of 
20 the CTNNB1 gene in normal human fetal and adult tissues as 
described under H A" . 

C: Detection of CTNNB1 /PLAGl transcripts in pleomorphic 
adenomas by Northern blot analysis using exon 1 of CTNNB1 as 
a molecular probe. Lane l, RNA of CG644 (t(3;8)), lane 2, 
25 RNA of CG580 (t(8;15)), and lane 3, RNA of CG682 (ins 3p21 
(8ql2)). The CTNNB1 /PLAGl fusion transcript is indicated. 

D: Using the same blot as under "C", detection of 
CTNNBl /PLAGl transcripts using a probe with specificity for 
the 3' UTR of PLAGl (probe KK64) . The CTNNBl /PLAGl trans- 
30 cript is indicated. 

EXAMPLE 3 

The discovery of PLAGl made it possible to identify 
PLAGl -related genes. The possibility that the human genome 
35 contains such genes was raised by Southern blot data showing 
bands (weak hybridization signals) . In this example, the 
identification of another member of the PLAG gene family 
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that might be of pathogenetical relevance is described, i.e. 
the PLAG 2 gene. 

To isolate cDNA clones corresponding to other members 
of the PLAG1 gene family, a cDNA library in A.ZAP was used 
5 (Stratagene) . As molecular probe, the complete human PLAG1 
cDNA was used. Screening of the cDNA library was performed 
according to routine procedures using low stringency hybrid- 
ization conditions ( Schoenmaker s et al. (1994)). This 
resulted in the isolation of a number of overlapping cDNA 

10 clones presumably corresponding to another member of the 
PLAG family. From the inserts of these, a 2.7 kbp composite 
cDNA could be constructed that contained an open reading 
frame (1233 nucleotides) for a protein (411 amino acids) 
structurally similar to PLAG1. This protein was designated 

15 PLAG 2 . The 2.7 kbp human PLAG2 cDNA fragment started 176 
nucleotides upstream of the ATG start codon. 

The nucleotide sequence of the composite PLAG2 cDNA and 
the deduced amino acid sequence of the corresponding PLAG 2 
protein are shown in Figure 8. Nucleotide sequences were 

20 determined according to the dideoxy chain termination method 
using the T7 polymerase sequencing kit of Pharmacia/ LKB. DNA 
fragments, subcloned in the pBLUESCRIPT or pGEM-3Zf (+) 
vector, were sequenced using standard primers and primers 
synthesized based upon newly obtained sequences. Nucleotide 

25 sequence data were obtained from both strands and analyzed 
using the sequence analysis computer programs Genepro (Ri- 
verside Scientific) , PC/Gene and Intelligenetics (IntelliGe- 
netics, Inc. ) . 

To established that the isolated PLAG 2 sequences cor- 

3 0 responded to expressed sequences, Northern blot analysis was 
performed according to routine procedures. Total RNA was 
extracted from various human tissues and cell lines using 
Trizol™ (GIBCO/BRL) . After precipitation, concentrations of 
RNA in the samples were estimated via spectroscopic determi- 

35 nation. 40 fig of total RNA from each sample was run on 1% 
agarose gels containing 1 x MOPS (0.02 M MOPS (Sigma) pH 
7.0, 50 mM Na-acetate, 10 mM EDTA pH 8.0), and 0.66 M for- 
maldehyde (Sigma) in a 1 x MOPS buffer for 16 h at 4 V/cm. 
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After gel electrophoresis, RNA was transferred to Hybond N* 
filters (Amersham) using 10 x SSPE (20 x SSPE: 3.0 M NaCl, 
0.16 M NaH 2 P0 4 pH 7.4, 0.02 M EDTA) for standard capillary 
blotting. After completion of the transfers, nylon filters 
5 were irradiated with ultraviolet light using a Stratalinker 
(Stratagene) . Prehybridizations of the blots were carried 
out for 3 h at 42 °C in hybridization buffer consisting of 5 
x SSPE, 10 x Denhardt's, 2% SDS, 50% formamide and 100 ^g/ml 
herring sperm DNA. For hybridizations, blots were incubated 
10 overnight in the same buffer as described for the prehybrid- 
izations. Subsequently, blots were washed for 40 min at room 
temperature in 2 x SSC containing 0.05% SDS and for 40 min 
in 0.1 x SSC containing 0.1% SDS at 50 °C. Autoradiographs 
were obtained by exposing Kodak X-AR film at -80 °C with 
15 intensifying screen. 

In Northern blot analysis using the complete 2.7 kb 
PIAG2 cDNA as molecular probe and rather stringent hybrid- 
ization conditions, four distinct mRNAs were detected. These 
varied in size and included mRNAs of about 4.4 kb, 3.1 kb, 
20 2.9 kb, and 1.9 kb. The 3.1 and 2.9 kb transcripts were 
preferentially seen in fetal tissue, whereas the other 
transcripts were seen preferentially in adult tissues. These 
results established the expression of the PLAG2 sequences in 
a variety of cell types. Furthermore, the detection of 
25 various mRNAs with the PLAG2 cDNA probe suggests the exis- 
tence of various isoforms and raises the possibility that 
various PLAG2 protein isoforms may be produced. 

EXAMPLE 4 

3 0 Detection of hybrid PLAG1 in salivary gland tumor cells. 

cDNA clones of the chromosome 3 -derived CTNNB1 gene 
were isolated and the nucleotide sequence thereof establis- 
hed. The nucleotide sequence data of a composite cDNA are 
shown in Fig. 9. The amino acid sequence of the CTNNB 1 — 
35 encoded protein was deduced. 

In 5' -RACE analysis of primary pleomorphic adenomas of 
the salivary glands, PLAG1 
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containing fusion trancripts were identified. 5 1 -RACE expe- 
riments were performed using total RNA of the primary tu- 
mors. A primer specific for exon 5 (MV2) was used in the 
cDNA synthesis. Following ligation of an adaptor to the 
5 cDNA, PCR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl -specif ic 
primer corresponding to sequences of exon 5 (MV5) and, in a 
second round, nested adapt or -specific primer and PLAGl- 
specif ic primer corresponding to sequences of exon 4 (MV6) . 

10 Nucleotide sequence analysis of the PCR product revealed 

that the ectopic sequences were fused to the acceptor splice 
site of exon 3 of PLAG1. BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 . the gene for 
6-catenin, which has previously been assigned to chromosome 

15 3p21. 

EXAMPLE 5 

Diagnostic test for pleomorphic adenoma of the salivary 
glands 

20 Fine needle biopsies of patients having a pleomorphic 

adenoma of the salivary gland (with a chromosome 8ql2 
aberration others with a normal karyotype) were taken. From 
the material thus obtained total RNA was extracted using the 
standard TRIZOL™ LS protocol from GIBCO BRL as described in 

25 the manual of the manufacturer. This total RNA was used to 
prepare the first strand of cDNA using reverse transcriptase 
(GIBCO/ BRL) and an oligo dT(17) primer containing an 
attached short additional nucleotide stretch. The sequence 
of the primer used is as described in Example 2, under point 

30 2.6 [AAG GAT CCG TCG ACA TC(T)17]. RNase H was subsequently 
used to remove the RNA from the synthesized DNA/RNA hybrid 
molecule. PCR was performed using a gene-specific primer 
(Example 2, point 2.6) and a primer complementary to the 
attached short additional nucleotide stretch. The thus 

35 obtained PCR product was analysed by gel electrophoresis. 
Fusion constructs were detected by comparing them with the 
background bands of normal cells of the same individual. 
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In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 
primer complementary to the short nucleotide stretch [AAG 
GAT CCG TCG ACA TC(T)17]. The sensitivity of the test was 
5 thus significantly improved. 

EXAMPLE 6 

Diagnostic test for uterine leiomyoma. 

10 Uterine leiomyoma is the most common pelvic tumor in 

women, occurring with an incidence up to 77% of women of 
reproductive age when tumors are counted after 2 millimeter 
serial sectioning of consecutive hysterectomy specimens. 
Often multiple leiomyomas are present, with estimates as 

15 high as an average of 6.5 tumors per uterus. Although most 
patients with these steroid-dependent tumors are asymptoma- 
tic, symptomatic leiomyomas can be associated with abnormal 
uterine bleeding, pelvic pain, urinary dysfunction, sponta- 
neous abortions, premature delivery and infertility. The 

20 high incidence of this benign smooth muscle tumor constitu- 
tes a major public health problem with the diagnosis of 
myomatous uterus leading to over 200,000 hysterectomies 
performed annually in the United states. Furthermore, non — 
random inactivation of the X chromosome has clearly demon- 

25 strated that uterine leiomyomas are monoclonal proliferati- 
ons. Although leiomyomata typically are benign tumors, in a 
very low percentage (< 0.1%) they might be able to metasta- 
size or even progress to malignancy. 

Besides a normal karyotype which is being found in 

3 0 approximately 70% of the cases investigated, several cytoge- 
netically abnormal subgroups, can be distinguished among 
leiomyomata. Leaving out of consideration the group which 
shows random changes, one of the largest cytogenetic sub- 
groups (comprising approximately 25% of the cytogenetically 
35 abnormal tumors) was first described in 1988 and is charac- 
terized by the involvement of 12ql5 and/or 14q23-24, mainly 
as t(12;14) (ql4-15;q23-24) . Another subgroup, with a similar 
incidence, contains deletions involving the long arm of 
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chromosome 7, with region q21-22 being the most probable 
commonly involved region. Another subset of leiomyomas is 
characterized by numerical aberrations, mainly trisomy 12. 
This trisomy is found in approximately 10% of the cytogene- 
5 tically abnormal leiomyomas. Furthermore, chromosome 6p21 — 
pter, has been found to be recurrently involved in roughly 
5% of the cases studied. Finally, a small percentage (ap- 
prox. 3.5%) of leiomyomata shows t(l;2) (p36;p24) . In uterine 
leiomyomata with chromosome 12ql4-15 aberrations, the HMGIC 

10 gene is affected and in those with chromosome 6p aberrati- 
ons, the HMGIY gene is affected. 

The observation that 70% of the uterine leiomyomata 
tested show a normal karyotype prompted studies to investi- 
gate whether activation of a PLAG gene could be found in 

15 uterine tumors. Therefore, Northern blot and PCR analysis 
were performed. 

Specimens of tumor material surgically removed from 
patients and classified by routine pathology as uterine 
leiomyoma, uterine leiomyosarcoma, fibro leiomyoma, or uteri - 

20 ne fibroma were used. From the material thus obtained, total 
RNA was extracted using the standard TRIZOL™ LS protocol 
from GIBCO BRL as described in the manual of the manufactu- 
rer. This total RNA was used in Northern blot analysis, 
which revealed activation of PLAG1 in about 50% of the 

25 tumors tested, indicating that specific PLAG activation is 
also observed in these types of tumors. These results were 
confirmed by PCR analysis. These results reveal the possibi- 
lities of using PLAG1 as a diagnostic marker on routine 
biopsies taken from patients. Since PLAG1 is implicated in 

30 this type of tumor, PIAG1 can be used as a target for the 
development of novel therapeutic strategies for uterine 
leiomyoma . 

EXAMPLE 7 

35 Diagnosis of lipoblastoma. 

Lipoblastomas are benign pediatric tumors that are 
assumed to result from proliferation of primitive adipocy- 
tes. These type of tumors often grow rapidly but there are 
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no reports that they metastasize. Because of the rapid 
growth and their histopathological features, lipoblastomas 
can mimic myxoid or well-differentiated liposarcoma which 
makes the diagnosis problematic. As a direct consequence, 
5 therefore, some of these tumors have been treated in an 
unnecessarily agressive manner. 

A recurrent rearrangement involving the long arm of 
chromosome 8 has been reported for lipoblastomas. Southern 
blot analysis detected a rearrangement in the PLAG1 gene 
10 opening the possibility of using PLAG1 as a diagnostic 
marker. 

A biopsy of a patient having a lipoblastoma was taken. 
From the material thus obtained total RNA was extracted 
using the standard TRIZOL™ LS protocol from GIBCO BRL as 

15 described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse 
transcriptase (GIBCO/ BRL) and an oligo dT(17) primer contai- 
ning an attached short additional nucleotide stretch. The 
sequence of the primer used is as described in Example 2, 

20 under point 2.6. RNase H was subsequently used to remove the 
RNA from the synthesized DNA/RNA hybrid molecule. PCR was 
performed using a gene-specific primer (Example 2, point 
2.6) and a primer complementary to the attached short addi- 
tional nucleotide stretch. The thus obtained PCR product was 

25 analysed by gel electrophoresis. Fusion constructs were 
detected by comparing them with the background bands of 
normal cells of the same individual. 

In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 

30 primer complementary to the short nucleotide stretch. The 
sensitivity of the test was thus significantly improved. 

EXAMPLE 8 

PLAG2 as diagnostic marker. 

To evaluate the diagnostic potential of the PLAG2 gene, 
we have first identified on which human chromosome the PLAG2 
gene is located and subsequently established its subchromo- 
somal location. Such mapping studies could link the gene to 
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known tumor-specific chromosome anomalies. Using several 
different PLAG2 containing YACs and cosmids in FISH analy- 
sis, the PLAG2 gene was tentatively mapped to chromosome 
band 6q24, a chromosome segment implicated in various tumors 
5 such as for instance malignant salivary gland tumors. 

Malignant salivary gland tumors are a heterogeneous 
group of tumors comprising at least 17 different entities 
according to the most recent WHO classification (1991) . 
Because of the large number of tumor types and the wide 

10 morphological spectrum that these tumors show they constitu- 
te a well recognized diagnostic problem. The most frequent 
chromosome abnormalities so far observed among malignant 
salivary gland tumors are deletions or translocations invol- 
ving 6q. Most of the deletions so far encountered are 

15 terminal deletions with breakpoints located at 6q22-q25. 

However, there are also a few cases on record with intersti- 
al 6q deletions. The minimal common region lost in the 
majority of all cases is 6q24-qter. The 6q deletions are not 
restricted to certain types of malignant salivary gland 

2 0 tumors, but seem to occur in all major histologic subtypes, 

including adenoid cystic carcinoma, adenocarcinoma, undiffe- 
rentiated carcinoma, mucoepidermoid carcinoma and acinic 
cell carcinoma. The frequency of 6q rearrangements vary 
somewhat in different tumor types. In for example adenoid 

25 cystic carcinomas clonal 6q deletions or translocations have 
been found in nearly 50 % of the cases. The fact that the 6q 
deletions are often the sole karyotypic change, indicate 
that they are primary cytogenetic events of pathogenetic 
importance to malignant salivary gland tumors. In addition 

30 to 6q deletions a subgroup of adenoid cystic carcinomas are 
characterized by a t(6;9) (q21-24 ;pl3-23 ) . This is a primary 
abnormality that is diagnostic for this tumor types. It 
should also be noted that in addition to malignant salivary 
gland tumors there are several other tumor types with recur- 

3 5 rent deletions of 6q, including e.g. various types of leuke- 

mias and lymphomas, malignant melanomas and neuroblastomas. 
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Involvement of PLAG2 in malignant salivary gland tumors 

This example is performed to test whether the different 
6q deletions commonly seen in different histologic subtypes 
of malignant salivary gland tumors involve the PLAG2 gene at 
5 6q24. 

8.1 Material and methods 

FISH analysis using the PLAG2 containing CEPH mega-YACs 
798D8 and 921C2 was performed on metaphase chromosomes from 
10 two cases of malignant salivary gland tumors, one 

adenocarcinoma and one adenoid cystic carcinoma, with 6q 
deletions. The methods for cytogenetic analysis and FISH 
were the same as described for PJLAG1. 

15 8.2 Results 

Hybridization with both YACs on metaphases from the 
adenocarcinoma resulted in signals of the same intensity on 
both chromosome 6 homologs at the expected position. There 
was no evidence of deletion of PLAG2 signals in this case. 

2 0 In contrast, the adenoid cystic carcinoma contained a 

subpopulation of cells, which in addition to the signal from 
the normal chromosome 6 showed either no signal or a weak 
signal from the 6q-marker chromosome. Both YACs gave a 
similar hybridization pattern. 

25 

8.3 Conclusion 

Preliminary studies suggest that at least one of the 
breakpoints in the 6q deletions in malignant salivary gland 
tumors map in the vicinity of the PLAG2 locus. PLAG2 is 



3 0 therefore a candidate gene for the commonly ocurring 6q 
deletions in malignant salivary gland tumors. 

EXAMPLE 10 

Animal tumor models involving PLAG1 as tools in in vivo 
35 therapeutic drug testing 

On the basis of the acquired PLAG1 knowledge, animal 
tumor models can be developed as tools for in vivo therapeu- 
tic drug testing. To achieve this objective (for instance 
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for pleomorphic adenoma of the salivary gland) , two approa- 
ches can be used, gene transfer (generation of transgenic 
animals) on the one hand and gene targeting technology 
(mimicking in vivo of a specific genetic aberration via 
5 homologous recombination in embryonic stem cells (ES cells)) 
on the other. These technologies allow manipulation of the 
genetic constitution of complex living systems in specific 
and pre-designed ways. For extensive technical details, see 
B. Hogan, R. Beddington, F. Constant ini, and E. Lacy; In: 

10 Manipulating the mouse embryo, a laboratory manual. Cold 
Spring Harbor Press, 1994; ISBN 0-87969-384-3. 

To aim at the mutation of the PLAG1 gene, specifically 
in selected cell types and selected moments in time, the 
recently described Cre/LoxP system can be used (Gu, H. et 

15 alj_ Deletion of a DNA polymerase 6 gene segment in T cells 
using cell type-specific gene targeting. Science 265, 103- 
106, 1994). The Cre enzyme is a recombinase from bacteriop- 
hage PI whose physiological role is to separate phage geno- 
mes that become joined to one another during infection. To 

20 achieve so, Cre lines up short sequences of phage DNA, 

called loxP sites and removes the DNA between them, leaving 
one loxP site behind. This system has now been shown to be 
effective in mammalian cells in excising at high efficiency 
chromosomal DNA. Tissue-specific inactivation or mutation of 

25 a gene using this system can be obtained via tissue-specific 
expression of the Cre enzyme. 

As an example, the development of animal model systems 
for pleomorphic adenoma of the salivary glands using a 
member of the PLAG gene family will be outlined below, such 

3 0 that the models will be instrumental in in vivo testing of 
therapeutic drugs. 

Two approaches will be followed: 

a) in vivo induction of specific genetic aberrations as 
observed in human patients ((conditional) gene (isogenic) 

3 5 targeting approach) ; and 

b) introduction of DNA constructs representative for the 
genetic aberrations observed in patients (gene transfer 
approach) . 
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DNA constructs to be used in gene transfer will be 
generated on the basis of observations made in patients 
suffering from pleomorphic adenomas of the salivary glands 
as far as structure and expression control are concerned; 
5 e.g. PLAG1 fusion genes with various translocation partner 
genes, especially the preferential translocation partner 
gene CTNNB1 of chromosome 3, i.e. complete PLAG1 under 
control of a strong (salvary gland-specific) promoter. 

10 EXAMPLE 11 

The preparation of antibodies against PLAG-encoded proteins 

One type of suitable molecules for use in diagnosis and 
therapy are antibodies directed against the PLAG genes. Two 
approaches have been followed to develop them. Based upon 

15 the nucleotide sequence of the PLAG cDNAs, computer analysis 
was used to predict the location of antigenic determinants 
within the proteins. Synthetic peptides containing such 
determinants were prepared and rabbits were immunized with 
these. As an alternative approach, cDNA sequences can be 

20 expressed in appropriate pro- and eukaryotic expression 

systems and the polypeptides synthesized in this way can be 
purified and used for immunization of mice. Cell lines 
obtained upon transfection or electroporation of constructs 
of the PLAG genes were helpful in characterizing the antibo- 

25 dies obtained. 

For the preparation of rabbit polyclonal antibodies 
directed against the PLAGl-encoded proteins, use was made of 
the following three commercially available peptides: 
(H-DLSEVRDTQKVPSGKR) 8-Multiple Antigen Peptide 

3 0 (H-FSSTSYAISIPEKEQPL) 8-MAP 
( H-QLPTQTQDLQDP ) 8— MAP 

obtainable from Research Genetics Inc., Huntsville, AL, USA. 
The polyclonal antibodies were made according to standard 
techniques . 

35 with respect to PLAG2 . suitable antibodies were genera- 

ted using hybrid proteins as antigens. These contained 
various portions of the PLAG 2 coding region fused in frame 
to GST (glutathion-S-transferase) . 
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EXAMPLE 12 

Inhibition of PIAG1 expression in tumor cells using antisen- 
se PLAG1 constructs. 

5 X±. Introduction 

PIAG1 can be used as target in novel therapeutic proto- 
cols for tumors in which it is implicated; e.g. pleomorphic 
adenomas of the salivary glands, uterine leiomyoma, etc. 
Cancer as a genetic disease is a logical target for gene 

10 therapy, either by replacing the missing or inactivated gene 
or by suppressing the activity of an unwanted gene. The high 
affinity of short DNA sequences for their target mRNA has 
indicated that antisense oligonucleotides constitute valua- 
ble reagents to specifically suppress the production of gene 

15 products, both in vitro and in vivo . Applications have been 
documented in various biomedical research areas, such as for 
instance cancer research, virology, and cardiovascular 
research. The in vitro activity, toxicity, and pharmacok- 
inetic data of antisense oligonucleotides are encouraging 

2 0 and in vivo animal experiments demonstrating suppression of 
neointimal formation seem very promising. Also in the field 
of cancer research promising results have been obtained. 
However, potential nonspecific effects of antisense oligonu- 
cleotides should be carefully considered when antisense 

25 agents are used to define biological functions of specific 
genes . 

Since expression of the PLAG1 gene is frequently acti- 
vated or strongly elevated in a wide variety of tumors and 
tumor cell lines, derived from tumors (rhabdomyosarcoma, 

30 uterine leiomyosarcoma, uterine leiomyoma, malignant 

salivary gland tumors) as well as leukemias and lymphomas, 
it was speculated that the PLAG 1 -encoded protein might play 
a key role in transformation of cells. This example shows 
that expression of the PLAG1 gene can be strongly reduced by 

35 expressing antisense PLAG1 sequences and that reduction of 
PIAG1 levels in tumor cells results in reversion of the 
transformed phenotype. Thus the expression or administration 
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of antisense molecules can be successfully applied 
therapeutically . 

iL. Materials and methods 
5 2.1 Tumor cell lines 

Tumor cell lines were generated from primary tumors as 
described by Kazmierczak et al. . Genes Chromosom. Cancer 
5:35-39, 1992. 

10 2.2 Culturing tumor cells 

Tumor cells were propagated in TC199 culture medium 
with Earle's salts, supplemented with 20% fetal bovine serum 
(GIBCO) , 200 lU/ml penicillin, and 200 microgram/ml strepto- 
mycin . 

15 

2.3 Transfection assay 

Transfections were performed using various protocols, 
namely: 

1. The calcium phosphate precipitation procedure of Graham 

2 0 and Van der Eb ("A new technique for the assay of the infec- 

tivity of human adenovirus." Virology 52: 456-467, 1973.). 

2. Lipof ection: Transfections were carried out using liposo- 
me-mediated DNA transfer ( lipof ectamine, GIBCO/BRL) accor- 
ding to the guidelines of the manufacturer. 

25 

2.4 Antisense constructs 

Sense and antisense constructs of the PLAG1 gene were 
obtained by inserting human PLAG1 cDNA sequences in both the 
sense and antisense orientation in expression vectors under 

3 0 transcriptional control of various promoter contexts, e.g. 

the long terminal repeat of Moloney murine leukemia virus, a 
CMV promoter, or the early promoter of SV40. For example, 
the CMV/PLAG1 plasmid was constructed by cloning a human 
PLAG1 cDNA fragment containing all coding sequences of human 
35 PLAG1 in pRC/CMV (Invitrogen) allowing expression under 
control of the human cytomegalovirus early promoter and 
enhancer, and selection for G418 resistance. 
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3 . Results 

3.1 Inhibition of PIAG1 expression 

Inhibition of PIAG1 expression was observed in tumor 
cells after induction of antisense PLAG1 expression in these 
5 tumor cells. Immunoprecipitation and Western blot analysis 
indicated a strong reduction of PLAG1 protein levels in the 
cells expressing antisense PIAG1 sequences. Therefore, this 
approach can be used therapeutically in tumors with involve- 
ment with PLAG1 . 
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CLAIMS 



1. Gene being in an isolated form and implicated 
in tumorigenesis having the nucleotide sequence of any one 
of the strands of any one of the members of the PLAG subfa- 
mily of zinc finger protein genes or CTNNBl genes, including 

5 modified versions thereof. 

2. Tumorigenesis gene (T-gene) as claimed in 
claim 1 having at least homology with the zinc finger 
domains that are typical for the PLAG1 gene the nucleotide 
sequence of which is depicted in figure 4A, or the 

10 complementary strand thereof, including modified or 
elongated versions of both strands. 

3. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG1 gene as depicted 
in figure 4A, or the complementary strand thereof, including 

15 modified or elongated versions of both strands. 

4. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG2 gene as depicted 
in figure 8A, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

20 5. T-gene as claimed in claim 1 having essentially 

the nucleotide sequence of the CTNNBl gene as depicted in 
figure 9, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

6. T-gene as claimed in claims 1-5 for use as a 
25 starting point for designing suitable expression-modulating 

compounds or techniques for the treatment of non-physiolo- 
gical proliferation phenomena in human or animal. 

7. T-gene as claimed in any one of the claims 1-6 
for use as a starting point for designing suitable nucleo- 

30 tide probes for (clinically/medically) diagnosing cells 

having a non-physiological proliferative capacity as compa- 
red to wildtype cells. 

8. T-gene as claimed in claims 1-7, which gene is 
implicated in tumors selected from rhabdomyosarcoma, uterine 

35 leiomyosarcoma, uterine leiomyoma, malignant salivary gland 
tumors, leukemias and lymphomas. 
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9. Protein encoded by a T-gene as claimed in 
claims 1-8 for use as a starting point for preparing suit- 
able antibodies for (clinically/medically) diagnosing cells 
having a non-physiological proliferative capacity as compa- 

5 red to wildtype cells. 

10. Derivatives of the T-gene as claimed in 
claim 1-8 for use in diagnosis and the preparation of thera- 
peutical compositions, wherein the derivatives are selected 
from the group consisting of sense and anti-sense cDNA or 

10 fragments thereof, transcripts of the gene or practically 
usable fragments thereof, fragments of the gene or its 
complementary strand, proteins encoded by the gene or frag- 
ments thereof, antibodies directed to the gene, the cDNA, 
the transcript, the protein or the fragments thereof, as 

15 well as antibody fragments. 

11. In situ diagnostic method for diagnosing 
interphase and/or metaphase cells having a non-physiological 
proliferative capacity, comprising at least some of the 
following steps: 

20 a) designing a set of nucleotide probes based on 

the information obtainable from the nucleotide sequence of 
the T-gene as claimed in claim 1-5, wherein at least one of 
the probes is hybridisable to a region of the aberrant gene 
substantially mapping at the same locus as a corresponding 

25 region of the wildtype gene and/or the same or another probe 
is hybridisable to a region of the aberrant gene mapping at 
a different locus than a corresponding region of the 
wildtype gene; 

b) incubating one or more interphase or metaphase 
3 0 chromosomes or interphase or metaphase cells having a non- 
physiological proliferative capacity, with the probe (s) 
under hybridising conditions; and 

c) visualising the hybridisation between the 
probe (s) and the gene. 

35 12. Method of diagnosing cells having a non-physi- 

ological proliferative capacity, comprising at least some of 
the following steps: 
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a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating a suitable T-gene-related macromo- 
lecule therefrom; 

5 c) analysing the macromolecule thus obtained by 

comparison with a wildtype reference molecule preferably 
from the same individual. 

13. Method as claimed in claim 12, comprising at 
least some of the following steps: 
10 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed; 

b) extracting total RNA thereof; 

c) preparing at least one first strand cDNA of the 
mRNA species in the total RNA extract, which cDNA comprises 

15 a suitable tail; 

d) performing a PCR and/or RT-PCR using a PLAG 
gene specific primer and a tail-specific and/or partner- 
specif /nested primer in order to amplify PLAG gene specific 
cDNA 1 s ; 

2 0 e) separating the PCR products on a gel to obtain 

a pattern of bands; 

f) evaluating the presence of aberrant bands by 
comparison to wildtype bands, preferably originating from 
the same individual. 
25 14. Method as claimed in claim 12, comprising at 

least some of the following steps: 

a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating total protein therefrom; 

3 0 c) separating the total protein on a gel to obtain 

essentially individual bands and optionally trnasferring the 
bands to a Western blot; 

d) hybridising the bands thus obtained with anti- 
bodies directed against a part of the protein encoded by the 

3 5 remaining part of the T-gene and against a part of the 
protein encoded by the substitution part of the T-gene; 

e) visualising the antigen-antibody reactions and 
establishing the presence of aberrant bands by comparison 
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with bands from wildtype proteins, preferably originating 
from the same individual. 

15. Method as claimed in claim 12, comprising at 
least some of the following steps: 
5 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed; 

b) isolating total DNA therefrom; 

c) digesting the DNA with one or more so-called 
"rare cutter" restriction enzymes; 

10 d) separating the digest thus prepared on a gel to 

obtain a separation pattern; 

e) optionally transfering the separation pattern 
to a Southern blot; 

f) hybridising the separation pattern in the gel 
15 or on the blot with one or more informative probes under 

hybridising conditions; 

g) visualising the hybridisations and establishing 
the presence of aberrant bands by comparison to wildtype 
bands, preferably originating from the same individual. 

2 0 16. Method as claimed in any one of the claims 

10-15, wherein the cells having a non-physiological 
proliferative capacity are selected from the group 
consisting of the tumors pleomorphic adenomas of the 
salivary gland, lipoblastomas , uterine leiomyomas, and other 
25 benign tumors as well as various malignant tumors, including 
but not limited to sarcomas (e.g. rhabdomyosarcoma, 
malignant salivary gland tumors) and leukemias and lympho- 
mas . 

17 . Anti-sense molecules of a T-gene as claimed in 

3 0 claims 1-8 for use in the treatment of diseases involving 

cells having a non-physiological proliferative capacity by 
modulating the expression of the gene. 

18. Expression inhibitors of the T-gene as claimed 
in claims 1-8 for use in the treatment of diseases involving 

35 cells having a non-physiological proliferative capacity. 

19. Diagnostic kit for performing the method as 
claimed in claim 11, comprising a suitable set of labeled 
nucleotide probes. 
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20. Diagnostic kit for performing the method as 
claimed in claim 12, comprising a suitable set of labeled 
probes . 

21. Diagnostic kit for performing the method as 

5 claimed in claim 13, comprising a suitable set of labeled T- 
gene specific and tail specific PCR primers. 

22. Diagnostic kit for performing the method as 
claimed in claim 15, comprising a suitable set of labeled 
probes, and suitable rare cutting restriction enzymes. 

10 23. Method for isolating other T-gene based on the 

existence of a fusion gene, fusion transcript or fusion pro- 
tein in a tumor cell by using at least a part of a T-gene 
for designing molecular tools (probes, primers etc.). 

24. T-gene obtainable by the method of claim 23. 

15 25. T— gene as claimed in claim 24 for use in dia- 

gnostic or therapeutic methods. 



FLAG GENE FAMILY AND TUMOR I GENE SIS 
ABSTRACT OF THE DISCLOSURE 

The present invention relates to a novel gene 
family identified as T-genes and being implicated in 
tumorigenesis having the nucleotide sequence of any one 
of the strands of any one of the members of the PLAG and 
CTNNB1 gene families. The genes and their derivatives 
may be used in various diagnostic and therapeutic 
applications . 
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SUBSTITUTE SHEET (RULE 26) 



5/19 nr / a 
PLAGl cDNA sequence f~ U 4 A 



GGCAGCGCAT ACACTACAAT GGCTGCTGGA 
TTCCAGGCCC GCCGCGTCCA GCCCGAAATA 
ATTCCGCGGG CGGTGTAGAG GCGGCGGACG 
AGCCCCGCGG TTGCCTCTTG GTGCTGCCTT 
ATGCTTCATT CTGTGACGGT CTATTAATAA 
GAGCAGGGCC TCAGATTGGC CAAAATGGGA 
TCCACGAAGA GTCAATGGGA CTGG CTAAGA 
CATCAGTAAT CAGTCCCTTT TTG CTTTCTT 
AGAAGCCACC TAAAG CTATA TCATTTAGTG 
GTCCAACAAG AAGGCCTGGT TTAGGCTGCG 
TGATTTGTCA GAAGTAAGAG AT AC CCAGAA 
AGCGTGGTGA AACCAAACCA AGAAAAAACT 
AAGGCCTTTA ACAGTGTTGA GAAATTAAAG 
AGGAGAGAGG CCCTACAAGT GCATACAACA 
TTTCTAAGTA CAAATTACAA AGGCACATGG 
ACCCACAAGT GTAATTATTG TGAGAAAATG 
GAAGAATCAC CTCCATACAC ACGACCCTAA 
AAGAATGTGG CAAGAACTAC AATACCAAGC 
GCCTTGCATG CCGCAACAAG TGGTGACCTC 
AACTTTTGAA AG CACGGGAG TGCTTCTGGA 
GCAAGTCGTC TGGTGGGGTT AAAGAAAAAA 
GATCGCCGGT TCTACACCCG AAAGGATGTC 
CACTGGAAGA AAGGACTTCC TCTGTCAGTA 
GAAAGGATCA CCTGACTCGA CATATGAAGA 
CTGAAGGTCA AAACAGAACC AGTGGATTTC 
TGTGTCTGTG C CTAT AAAAG ACGAGCTCCT 
CCAGTGAACT GTTATCAAAG CCATTCACAA 
T ACAAC ACT C CATTTCAGTC CATGCAGAGC 
GATCACAACT TTACCTTTGG GAATGACATG 
TTCATCCCTC TCACCACCTT TCTTTCAAAT 
TATGCAATTT CTATTCCTGA AAAAGAACAG 
GAGTTACCTG ATGGAGTTAC AAGGTGGCGT 
CTCAAGCATC GTCATCATCT AAGCTAGGGT 
CTAGATGATG GTGCAGGAGA CCTCTCCCTA 
CAGTGACCCC CTAAACACAC CAGCATTGGA 
TCATACCTTT AAATGGTCCT CCCTATAATC 
GGAATGAGCT ATTCCCAGGA AGAAGCACAT 
CACACAAACA CAGGATCTTC AGGATCCTGC 
CTCTGCACTC ACTGTCAGCA GCTTTCACCA 
ACCCTCCCAC GTTTCCATCA AGCTTTTCAG 
TCATTACAGA AATGTATGTG TAGCTGTGCC 
TAGTG CCTAC TTTAAAACAG TATAAAAATT 
ATTTTCATTA AGC CAGTAT A AAATAGAAAC 
GGAACCATTT GTGTTCAGTT AAGTTTACCT 
CTGCCAATTG TCACATTTTA AGACTTTTTT 
ATTATTAGTA GTAAACTTTT ACAAATCCCA 
CTTAAAATTT TCATTTTTGT CTAATAACAG 
TGGCTCATTA AAAAATTTAG CAATAGAATG 
TGAATAACTC AAGGGTTTAA ATTTTCTTAC 
AATCAAGTGC TTCAAATGAA TTAAGAGTCC 
GTTTGTTAGA TACACCATAA TTTCAGATCA 
GTTGTCTGGC TAAAATATTT GCCATCTTTA 
CAAACCCTAA ACACAAAGCA TCAGTATTTA 



AAGAGGCGTA AGGAAACAAT 50 
TGAGAAAAAA ATTATTAGAA 100 
GGCCGGAGGG AGGATGTTAA 150 
GGCCGTATTT GGCACCCAGA 2 00 
GGTTGCCTTG CTAGAGTTTG 250 
AGGATTGGAT TCCACTCTCT 3 00 
TCAAAGTCTG AGGCTTTTTC 3 50 
TTACGACCAC ATGAAACTTG 4 00 
GAGTTGGGCA GTTCCCAAGT 450 
ATGGCCACTG TCATTCCTGG 500 
AGTCCCTTCA GGGAAACGTA 550 
TTCCTTGCCA ACTGTGTGAC 600 
GTTCACTCCT ACTCTCACAC 650 
AGACTGCACC AAGGCCTTTG 700 
CTACTCATTC TCCTGAGAAA 750 
TTTCACCGGA AAGATCATCT 8 00 
CAAAGAGACG TTTAAGTGCG 850 
TTGGATTTAA ACGTCACTTG 900 
AC CTGT AAGG TATGTTTGCA 950 
G C AC CTTAAA TCTCATGCAG 1000 
AGCACCAGTG CGAACATTGT 1050 
CGGAGACACA TGGTGGTGCA 1100 
TTGTGCACAG AGATTTGGGC 10 50 
AGAGTCACAA TCAAGAGCTT 120 0 
CTTGACCCAT TTACCTGCAA 1250 
TCCGGTGATG TCCTTACCTT 13 0 0 
ACACTTTGCA GTTAAACCTC 13 5 0 
TCGGGATCTG CCCACCAAAT 14 0 0 
CCCAATAGAT ATGGACACTG 14 50 
AT C CGTTCAG TTCTACCTCA 150 0 
CCATTAAAGG GGGAAATTGA 1550 
GCCCTCTTCA TCCCAAGATT 16 00 
TGGATCCTCA GATTGGGTCC 165 0 
TCCAAAAGCT CTATCTCCAT 170 0 
TTTTTCTCAG TTG TTTAATT 175 0 
CT CTAT C AG T GGGGAGCCTT 180 0 
TCTTCTGTTT CCCAGCTCCC 185 0 
AAACACTATA GGGCTTGGGT 1900 
GCAGTTTAAG CACAAGTACC 195 0 
TAGGATTCTG GGACATGGAT 2 00 0 
CTAGATGACC ATTTTTATTT 2 05 0 
TCTGCTTTTG TATAATACAA 210 0 
TAG CTTTTAA ACTGAG CTTT 2150 
GGGTATTTTG TCCTGATTCA 22 0 0 
TTTTTCCATA TAG G AAAG CC 2 25 0 
TTTTCAAATT ACTTTTAGAT 23 00 
TGGCTCTACC TTTTGACATC 23 5 0 
TAAATTGTAT AAAAAGTTTG 24 00 
TAG CTT CTAA ATGGATTAAT 24 50 
AGTTTCGGAA GATAATAAAT 25 0 0 
GTATATTCTG AAGACTCTCT 2 550 
TTATGAGCCT TTAAGGAAAA 2 60 0 
TAGCAAAAAG AGACTCTGTT 2 6 50 
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AGGTGACATG GCATTTCGTG TCACTTAATA GTTGGCCCTA AATTAGTACA 27 00 
CAGGATATTT TGTCGTGTTT CATCCTTCTT AACATG CTAT CTTTTCATTT 2750 
AATAATAGTA ATAGTGTATG GCATTGGGGT CTTCAGAGTC GATATATAGG 2 800 
TAGATCTCTT TAGTCTTTTC CACCTTTCAC ATCCAAGGGG TGGGTCAAGT 2 650 
GCAGCCAGCA ATTTATTTTC ATTGTTGGCC CACGGTTAGT CCATAATCTA 2900 
GAGCCATTGT GGAACTGCAG CCATGAGGTG TGTTTATCCC ACAGTGGATT 2950 
GACTCAGCCT CTGTGGGTGA CAGACTTCTA AGCAGGAAGA TAGACGTGAA 3000 
GCACATGGTT ACATTTGGGA ACTTGTGTAG GGATCATGGC CCCTGTAGCC 3 050 
AGGGTTAAAA A CTGGACTTT TTAGAAGTAA AGTAAAAGCA TAKCGCTTAT 3100 
ATCATTTCTT GCTGAATTTG ATATGTTTTT CTTTCCCTTA AGAATCAAAA 3150 
GCAGAAAACA AAAACAACAG TCCTACTCCG ATGTTAT CTT TCTGATTCAA 3200 
TGTGAATCCA TCTTTCCTTG CAATATTTTG GATGGAGAAT TTGAAGTTAA 3 25 0 
ATG CATTAGA AAACTACCTG ATGAACTACC ACAAAGTTTT AAGTGACTAG 33 00 
AAATATATAC AGTAAAATCC CACTTTCATG CATCTCTGGG AAATGATAGG 3 3 50 
AGTATTGCAA ATAAGTTGAG TTTGTAGAGG GTAACAAAGT AAAGTAAAAC 34 00 
AAACCTATCT TGGTTAACAT GAAAATAACA ATTGAGAATA TATTATATTC 34 5 0 
ACTGAATAAT TATAGGCTTT T CCT CACATT AGACAACCAA CATAATCTTC 3 500 
TTAAAGGTCT AATTAATATA TTTTTCTAAG GGTCAGTTGG GACATTAACC 3 550 
TAAGAAACAT ATCTATTAAG CACTTGTTAA CACCTTATTT TAGGACCCTT 3 6 00 
TCCGTTGGGG ATGGGGGCAA GGGTGGGAGG TTTTTAGAAG AGTATATATC 3 650 
TCTTTAAAAA AAAACAGAAA GAAAAATATT TCTGAGCACT CATTAGCCCT 3 700 
ATATGGAAAC TTCTTTCCTT TTTGTAGGGC CAGTTATCAC TGCAGATTGC 3 750 
AATGTTTACC AAGAATTTCT AAAAATGAGT G C AG ATT ACT GAATATAATA 3 8 00 
CATTATTTAA AATATTTGGG AGTAGTATAA TTTGTTGAGA AATGTAAATT 3 8 50 
GTAATAATGT AAATGGGGGG CTTCAATATA TATATATAAT ACACACACAC 3 900 
ACACACATGC AC ACAT AC CG CACTTCATAG AATCAAAGTT GCTCTCTGAA 3 9 50 
GGAGCTTTGG CTCCTGATAT TTTATCATGC TCCTATATTT TTTTAAT CCT 4O0 0 
TGGAGCAGTA GTTTTTATAC TTATGTATTT AAATTTTATT ATGAAAAATT 4 05 0 
ACATTTATTA AAAAAGTGTG TTCCAAAGGC ATTAAAATTA TAT ATG TT AA 4100 
TAAGGAAGTA CATTTTTAAA TTTTTCAAAC TGCTCCTAGC TTTTGATTAG 4150 
GAGAATATTT TTTCTGAAAG TAGGCTTTTC GCTCTGCTTC ATTACTGCTT 4 2 00 
CCTTTAGTTT CTATGAAACA GATTGCTTAC CTAAAT CTTT AGTTGAATGA 4250 
TTAGTGTT C A ATATTGCTTT AATCACCATA TAAAAGGAAA AAAATTGGTG 43 00 
ACAGAGCACA AATAGAAAAC CTATTTTTAA AT AG AAAT C A CAAATAGCAA 4 3 50 
GTGTGGAAGC ACTACTTTAT TCTGTTTAAA ATGTACTTAA GAAGTCATCA 44 00 
AATTAGTGAA CTGAGACATT GGCCTTAGTA GGCTGTATTC ACTGCTAATT 4450 
TAAAAAAGGG AGTACCAGGA TTTATTAAGT AAAGCATTTT GGAAATGGGG 4 500 
AATAGCGCCA TATATGTATG TATGTGTATG TGTGTGTGTG GTGTGTGTAT 4 550 
ATATACACAC ACACATACAT ACTTAAATCT TGCCCTGCAT GAAATTCAAA 4 6 00 
TACATGGAGG CACATCTTCA GGGCACCAGT GTTAAAATTT TGGAGTCTTA 4 6 50 
ATTTTCATGT GTACACCTCT TTGCCTGTTC CCACCCCCAG ACTTGAAATA 4 700 
ACACTTCAGA GTAAGAGGGA ATTCAGCTAA TTTGTTTTTA AAATTGACTG 4 7 50 
TAGTGGTCAC TAAACCCTTT TTGAGAGAAT TT CTATTAAA GATGAGGCAG 4 8 00 
ACTCGCTTAT TTGAATTGCA CAATGTTCTA ACAAGGATGT AACACAGAAT 4 8 50 
TGGCTTTTTT TTCCCTAGAA AAAGATTGTT TGTTTCTATG TCAACTAGAT 4 900 
ATGATTAAAA ATAAGTATTG CCAATGCTGT TTTCATTCTC TAGTGGCCAG 4 950 
AATCATTATC CTTGAAATTT CTGGTAGTGC CTTAGCTTGG TTAAAAAAAA 4 500 
AAAAAAAAAA AAAAAAAAAG GGATTAACAT TAAATAAAAG TAG TTTAG AA 4 550 
TTTGGGCCTC AGACAAGATA TTGAACCTCA TTCAGTTTCA CTTCCACATG 510 0 
TATGTACAAG TTAGGTCACC AAACACGGAA GTTGAGTGTG GAAGGATCTT 5150 
GGCACTGTAA GCAATGCTAT CCATTGATGT ATACAAGTAC CTTTATAGTT 52 0 0 
AT CGATCACT GTTAAAACTT T CATTTT AAA ATCCTATTAC CAAGTTCAGT 52 5 0 
TTTTTAAAAC TTCAATTGTC CTGGCTGATT ATGCATCACT CTGTGTGCAA 53 0 0 
CTTTTTTATT TCATTTAGTG TTTCTTTCAA GCTGTGTATT TTTGCCTATT 53 5 0 
TGTTGCTTGT GCTTTATTTT TCTTAGTCAT TTGTGGAATA TAGTGATATA 54 00 
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TTGTGTTAAT TTGGACAGTA GCGGTTTTTA AAAACCATAT ACTGACTGAA 5450 
ACATGAGCCA GAGCCGATTG CTTTATTAAG CTAATAATGA ATGTTAAAGA 5500 
GTACATATTT TCAGGATCGT TCATCTAGTG AGCAATACAC ATATTATAGG 5550 
CCAATATTTT TTTAAAAAAT AGAGCTTGGT CAACCTCTAT ACTACACATA 5600 
TTACAAGATA TAGCACTTTC AAAATGAATC TAAAC CTTT A CAGAAACTTT 5650 
CTTATAGGTT ATGCCTTTTA TTTTAAGACT TATTATAATT CAAGTGCCAT 5700 
TAGATGATAT ATATGTAGGC CTTTGATATA TAATG CTTTG TGTACAAAAA 5750 
TGGTAGATGG TATTTTAAAC AGGTACATTT TTACAGTGTT TTCTTATCAA 5800 
TTTGCTATAT TGCACAGAAT CAGTGTGTGT CTTTTCATAA GGTTTTACAA 5850 
TGGTTTATTT TTTTACAAGG TTTACGTGTC TCAAAGCACA CTGTCTTCCC 5900 
AGTACGTAAG TTAAAAAATA CCAGTTCACC CAAGTTGCTT CTAGCCTACT 5950 
GAGAT CCATG TGACATTGGA GGAGATCTTT TAAATGTTTA GTATTCGTCA 6000 
TTAGCAATGG CTGGCTGTTA GTTCTGGTAA ATGTGTGCCT AAGTTGAATT 6050 
TGTCTTGTTT TTCTCACACT GTGTCAGCAG CCATGTCTAC AACACAGATA 6100 
AGTCTGTTGT GATCACATAG ATCTACATAA GTTGTGCAGT TTTGTGCTAA 6150 
AAACC CATAG GGAGCTCCTT TGGGATCATA GAAAAGAAGA TCATGCAACC 6 2 00 
AGCATTGGTG AAGGCACACT CAGATTGCAC TTAGGGCCTT TCTATGATGT 6250 
TGTCAACCCT CTGAGGATGG AAGGCAGTGT CTTTTGATGT TATCTAGCCT 63 00 
AGAAATGACA CAGAACTATT GCTAATGTAT AAAACACTTC ATTATATAAG 63 50 
CTTCAGTGGT ACAGATGAAC CAGAATGAAT GTTTATCTTC T CAGAAACAC 64 00 
TCCTT CAATA TTATATTGGA TCATGCTGCT AATGTAACTT GGGCTACAAC 6450 
TCTTCATGGT GCTACAAACT TCTCTGTCTC ATTCAGTCGT ATTTTTTTAT 6500 
CCATAGAAAA AGGACTACAT TAGGTGTAAA AGTGTACAAT ATATTTTTAT 6550 
ACTGTGACTT AATTTGT CAT TAACAAACTT TTACACCACC ACAATGTATT 66 0 0 
CATGTGCACT TGCAAAAGGA GATCTCGGAC ATGCAAATG T TACCAGAACA 6 6 50 
AACCCAGCTT TTGTCCACAA GGTGACTGTA ACTCAGAATG GAAAGTGGGC 6700 
TTTATAATAG GGTGTGGAGT GAAGAACATG CTGTATGTTA CTAACAGCCC 6 750 
TTTGAATTTA ACAAAAACTG GGAATCCATT AGGAAACGGA TTGCATCATA 6 8 00 
CCTGAACATA AGCTGGACTG CTGAAATTGT ATTTTT AG CT AATGAAAAAG 6 850 
TGTTTGGACT AGTACTCTAA AAATGTTCTA ATGATAAAGT TTTGAGTCAA 6 900 
AATAGAAAAG AAAAAAATCT GCATTCCAGG CCGAATTTTG TATATTTTTA 6 950 
TTGCATTTAA AATTGCTATT C7GTAATATT GGGAAATCAA GTGGCTTATC 7000 
ATGTATATCG TGTACTTAAA ATGTATTCAC AAACTACTGT TGTATTTGTA 70 50 
TAAAATATAG ACAAAGATCA TATTTTTTGT GTGTGTATAA GCTCTGTAAA 7100 
ATAGCAATCA CATTATGAAG CTG CAGTG AT ACTACATTTT AAACATTCAC 7150 
ATCCAAAGAA GCAGACTATT TATTGTCCAT ATACCAGATT TAAAATATT A 7200 
ATTTGCTGCT AATTAAATAA TAGTACTGCA GCTTCTTGTG GCCTACAGTG 72 50 
TTATGTTTGC TGTAAGAATA AGATATGTGA ATTCCACAAA ATATATGAAT 7300 
AAAATCTCGT GCC 7313 
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FIG. 6 A 
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STSs used to generate the 300 kb cosmid contig mapping 
at chromosome 8q12 and encompassing PLAG1 



STS CHI 29 

GAATTCTAAAACCATTTATAAATCATACTGAATCCCAGAACAATATATTTTAAACAACTTAA 
AAAAAAGAACAAAATAAAATAGCAAAACATTTTTAAGAGTGTAGATTCTTTGAAATTAAAGG 
ACATACTTACCCTGTAGT 



STS CH280 

GAATTCTTGCACCGGTTTTTTCTTATCAGTGTGGGCTGATGTTCCATTAACTGTGGTGTAAT 
TTGAGTATAGTCACTGACTGATTCTAGATATTTTCAGAGGGTCAAGACTTTTTCTAAGACCT 
TTATATGTGGTTGAATTCTTGTTCTTGGTTTCACAGAAGGTATATTAGCAAAGCATTTTTGG 
TGTTGAAGCTTGGTCTGTGATCTAGT 



STS CH33 

GAATTCGTTTTTATTTGACAAGCACATGAAGCCTTATCAGACGGAGGCCTCAATCCTTTGGC 
TGGGGTTTATAAGCAGGTAGCGCTAGACCTTCCCATTCTACATAAGCTGATGGGCACGGTAA 
TAGCTGGGGGTTTTCTCACAAGTCAAAGACAAATTGTCTGTTTTCAAGCGTGTGAAACAGTT 
wAAWACGTTTGAGGTCTCTCTCTTGCTTCATAGGCCATCTTGGCTCAGACATTCTACAGMCA 



STS EM156 

TCTG AG C AAC AAG AG C G AAACTCC ATCTCAAAAT AT AT AT ATAT AT AGGTAATTGTTGTCAT 
TAATATTAATGTAGTAGCAGCAGCAACAGTCATGGTAGCAATATTGCTCTATTTGGGAGGCA 
ACTTATAATTATTAACTGTGGAATATCTTTGAAAAATGTTTTTNGCAGAMGTTATGTTCCCA 
TTCCTGACTGGMGCTCATTATAAATACCCATCTTCTCTGAATAGCGCAAGGACTTTTGAAAA 
AGTGTTCTGAGTAAAC 



STS EM195 

ACAATCAATTTTAGAAAGTAATCATTTCATTACCCCAAACTGAAACCCTGTACCTGTTAGCA 
CTCACTCCCCTTTTCATTTTACTTTTTATTTATTTATTTTTTTGAGAGAGACTTGCTCTATC 
GCCCNGGCNVCAGTGCAGTGGCACAaTCTCAACTCACTGCAACCTCTGCCTGCCAGGGTCAA 
GTGATTCTTGTGCCTCAGAGTCCCAAGTACCTGGGATTACAGGCATAAGCCACCACGCCTGG 
CTAAATTTTGTATTTTCAGTAGTGACGGGGTTTCACCATGTTGGCCAGGCTGTCTCAAACTG 
CTGACCTCAGGTAATCCACCCTCCTCAGCCTCCCAGAGTTCTGGGATTACAGGCGTGACACC 
GTGCCTGGCTCATTTTATTTTTTAGAGATGAGATCTCACTCTKWTTGCCCAGGCTTCAGTGC 
ATTGGCGTCATGATGGCTCACTGCAGGCTTCAGCTCCTGGCCTCAAaGCATCCTTCCGCCTCA 



STS EM208 

CTAGGCGACAGAGCAAGACTCTGTCTCAARGAAAAAaAAAAAaVRAAAAAAATTACCAAAAC 
TGACTACAGAAAAVHGVARGGTTGAATAGCCTTACATTTGGVAAATAATTTTTATTTATAAT 
TAAAGATATTTTTATAAAAWTACTCTAGGCCCATAAGGCTTCACAGGTTAATTGTATTAAA 
TATTTAAGGAAAAAATAATACCAATCTTATTCATAGTCTTTCAGAAAATAGAGGCGTATCCA 
TTTTTCTAACTCATTTTAAGAAATAGCATCATTCTAATATCAAAAGCAAACAAGGMCATTGC 
AAAGAAGAAGGGGAAGAAGGAAGAGGAAGAGGAGGAGAAAGGGAAGCAGGAGATGGAGAAGA 
AGGAAGCCAGGTACAGTGCAATATTTCTCATGAACATAAACACAATTTTTAAAAAGTATTAM 
CAGGCTGGGCTTGGTGGCTCTCACCCGTAATCCCAGCMCTTTGGAAGGCCcAAGGCGGGTGG 
GHC AC AAGGTC AGG GGTTCGAG 
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FIG. 10 (continued) 

STS EM216 

TACTRACTGCTGTGCAGTTBTCCTGCAGTCAGTTCCAGAGGTCATTTCTAACGTTGCACTAT 
GGGKCTATTTAATAGGTTTCCTAAAGAACAAACATATCTCTTTAbAGTTACTCAGAGGGTAC 
ACAATGATGATGTCACACAATTAATTACCTATTAAGACTGAAATCCAGCAATGCATAGKGTG 
TGGACTTTACGCACATCCAGAAAAAGTTCTAGCACAAATTGTTTTHGTMTYATATATTTCAG 
AAGCCATAGAAACACTATTAAAGCCCTCCCTAATCACTTAGGGATGCAAAaTCAATAT 



STS EM317 

GACCAACAAAGGCACACAAAGATTGGTTGCTTTCTGAAGAATCTAAAAATGGCATTGGGTAT 
AGGAGTTGGGGAAGCAAGTTGTATAGGCACCTACACTTAAGATAATTTGTCAATTATACAAA 
TAATTTTTAAAGTTT AAG C CC CTTTCTGAC ATGACACGTCCATGGGTCCTTCACCCTTYT t K 
KTCTCCTSCAGAGCTCCAGTCTGCCYYTTYTTKSCTCTGAGCTCCAAAAMCAGTGAWTCCCC 
TGAAGTTACCTAGMCCCMCCATACAGTTTGTGACTCCCTAWMCcGGGGGTACCyTCCCATGY 
CTGGCTAATAyTGABTYTTGTDTACCGTGGCTTCTGTGTTACTACATTTGTTTTARTGGAAT 
TWATwAArGGGAAGCCTATCAA 



STS EM416 

GAGCAACTGAaCDNAGATTGGGTGAGGTAAGATGTGGGCTGCACAGGTGAGGCTGGAGAGGT 
GGGGAGTGCGTCCCAGTCGGGGGAGAAGAAGAAAAGGGCAGACTAGGGTAGAAATGCTTATW 
ACTcCTGTGACTGGAGCTGATGGTGTCTTAAGGAAAGTGGTGGGAAGGGAGGVCTGCAGAAA 
GGCAAGGCTGGAGTCGACTGAAGGCTGGAGAGCCACTGCTTTAACAAGTGTAMCTGGAGATG 
GAAGGGGCTGCAGGACAGGTCACTCAGCCAGTKGTGTGGARGCAATCTCACC 



STS EM443 

TTGATATTTGTTCTAACTCCACATTAACTATTGACAAATACTCTAAATTGTAGCTACCATCT 
GTTACGTAGCTAGCAGGTACCCTAACAGCAATGGGTCAGCTTTTGAGTAGCGTTTCAACCAT 
GTTACCTCGAGTACGGTGTGGTGAGGCCAGACGCAGATGGAGAGAAAGAAACAGAATCGAGC 
ATTTCCATTTTGTTTTGCTCACAGTCCCCAGGGGCAAACACAGCACAGCCTACAGGACCATG 
AAGGGGAGCACTGGGGTCACTCATGAAGCAGGGAGGTCGGGCCAGTGGTGGGGGgCCTTTAT 
GTGTTTTCCTCAGGAAGGAATGGGCAAGGCAGGGTAAGCATGTTCAGGACTGGTTAATTTGA 
ATAACTTCAGGGGGgCTCTAGGGCCTGgRGGCTGCCCCTGGTTGTCTGGTACCYgGSCCTG 



STS EM46 

ATATCAATCTTGGGTCTATGTATGTTTTTGCTTTTCCCcAGTGTTCCAGGCATGATGCTAAG 
GATATAGTGGATGATGAAATATATGCTTGCTGAATATGGGAATAAGAATTATTTTATGATCA 

CTCAGCTCACWGCAACCTCTGVCTCCTGGGTTCAAGTGATT 



STS EM47 

GTAGAGACACACTAGGCATGCACAGACCAGTGCAGAATGAACAATATTTGTTACATGTGTAG 
TTCTTTATGGTTTACAAAACTCTCCCAGCCATTATCTTCTTTCAGCCTTATAAAAGACAGAG 
CATATTTTATTATCCTCATTTACCTWHTCTAGTAAGGCATTTTTTCTTTTTTTCTTACTAGA 
GATATAAGGCTTAGGAAAAAAGTGAATACTACGATAAATGAATACTAGGAAAAGACATCACA 
ATCACAAATTTATTAATATCAGAAAACAGDTTTTAAGAATAAAATWTTCAAWAARgAAA 
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FIG. 10 (continued) 



STS END 2 

TAATTTATCACTACGGAATTCTGTGCAGTGAGATCAAAGAGCTGTGTATGCCCATAATGTGA 
TTTTACAGCCATTTTGTAAAAACTGTAAAATACCTTAATATTCAATTTGGCTTAAGGTACAT 
TGAGGACTTCTGGTTGAAAATTACAGAGTGGTGAAGATTC 



Known STSs 
PENK 
D8S285 
MOS 



STSs part of PIAG1 

EM265 

KK64 

KK63/EM209 
KK55/CH283 
EM224 
EM387 
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